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Anlno acid segment containing signal peptide 
(AwAlanine, CoCystelne, D=Aspartlc Acid, K« 
Glutamic Acid. P= Phenylalanine. G=Glycine. 
H-Histidine, I*»Isoleucine, BULycine, 

Ir Leucine, *MtethionjLne, H=Asparagine. 

P« Proline, Q=Glutamine, R=Arginine, 

S -Serine, ToThreooine , V-Valine, 

K*»Tryptophan, Y=Tyro3ine, X^Ohk&own, *=Stop 
Codon, /^possible nucleotide deletion, 

\ -possible nucleotide insertion) 



AGSQKI^S WRRAASLLSKWDS LAPS 1TNVLVQGKQVTLGAFG 
HRBBVTSNPiiSPRVIQErirYYKOJTHDBRBAVIQQKLVIHIGHI 
ISHHPBLPSQTLKIHJgwriHAMBYWXOTOGPgPALDLYOLaP 
SEVKQIilJjllXiPCXiHGRO^XJfHKQIDGSLKRTPTGFYDRVWQI 
I^HTPHGIlVZVGKHLPQQPTLSWfWYKMHPSLLVEDTLGlJIDQ 
PQYRQIVVELWIVVSIVI^ENPBliBFUDECVDLOKLVKKAFHKFQ 
KDQSRLKBI KKQDDKTSPYWPPLGKEGTCS TLTOWNHLLLHG 
BVKFHHDDPGUES 



VQGYMYRFVIKIMSCYSKKTSI CHHROCPBLPArQPWPTPTVPP 
NIAIDSBSLGdXSFKLFADKV/PKRWKKHPVLUfTGfiKVLODK 
GPCPYttHPQ\LOQOGPPT BHMg T GGK SLVSKBPDDgHFl /LKS 
YAPGVLSDUiAflPTnSQSQPFICTAICTBDG^QHVWGRVKlXSMS 
IVHALK3SGSKW3KTSKKITAANCGQI> 



RRP PBGGSGGGSRTRABMPLPwSlJVLPLLLS wvaggfgnaasar 
HBGLLASARQPGVCHYGTKLAOCYG tnUaiSHGVCBArCBPGCRP 
eSCVGKiKCRCFP QY l GK TCSODVSBOOOCPgPOQPRCVHTHGS 
YKCPCLSCSHMI^IPIWrCTnSflTCAKIHCCfYSCEDTEBGPQCLCP 
SSOIiHIAPtKHUXIjDIDBCASaKVICPYNRRCVHTFOSYyCKCH 
rePBL(3YXSGRYDCIDIWgC3WD3HTCSHHflHCFWTQ GSFKC KC 
KOGYKGNGLRCSAIPENSVKBVLHAPGTIKDaiKKLLAHKKSMK 
KKAOK H VTPBmTPrPKVHLQPggYB8IVSRGGHSHGG\KKP 
MBBg>UEBG LBDBKRBBI OUJgp*MEHgEPFRG\PVPPPICVHEftCB 
PGIdL\Vt»KALTSB3iKHXADlKISVr)CSPNHG\ICDW\KQDR\ 
BDDFDW\CJPADR\DMAI \GPY\MAVPGmQOHK\KDI GRLKLL L 
PDLQTOSHFCliFDTfRLRGDKVGKLEVFVTOlSiaHALAWBKTTSB 
DEKKKTGKIQLYQGTCA3XBI1FEABRGKQKTGBXAVDGVLLVS 
GLCPD5LLSVDD 



ACFHXARRRKHVRSIjSRRIUJWlilARWSRGQWNMAARRITQBTFD 
AVLQBKAKRYHMDASGfiAVSBXIOFKAWLLRAVPRSRABHYDD 
VKSDGRTSLSGSVAHSRDAGRKSLRSDVFSGPS7R5SNPSISDD 
STFRKJKXSRDIiBKSHSHSRDQViaHRKUGHFHSQIJNKFALSGfiW 
BQDFGH PVSQBSSKSQBYSFG PSAVUJDFGSSRLI KKECLKXB \ 
SR0fDVIlHSG\HA\DSVLRGS\SQVOA\RGRAmiVDQEGSI»LG 
. KGBTQGIXTAiaXJVGKLVTIdDfVSTiaaPT^ITPKTQCrrHQI 
QHKTPS PpyrLGTHPQTEDIQPPIQKI PLGLDLKHUtLPRRKMS 
PDIIDISDVPSHPGI BIIKKAGPHTimDIKPSQIiPCmJKLBT 
gTC^SMUV3FKC31JCPgHRDFCFPTIgPTiKR3RT.TTPHVDWBPL 
NMLLDKGAVICr roCPPBI IKPFDgyiMgljQPHLLKSVTPIJ^AC 
HMfBL SV 10 0 C H «SMPIil>iJ^LAI«STPIS 1 J P U tS LALLGQlTFSIAS 
SPRQBKH«*AVGI^Ql>IAPSPAAFFNFBl>STLFGHBtIDHLKAHL 
VSSGCPLQVKKAEPEFMREBBKMIPPTKPBIQAKAPSSLSDAVP 
QRADttffVVGTXDQLVIRVIBa3I>9 PKKBTLLKEDPA YWPLS DEB 
SJUBYKfyxlJCIABHQBMSBNLRGADQKPTSADGAVBAHLYSRAV 
RHLKKKLLP\WQRRGLLRAQG\LfiJG\WKAKHA\TTGTOTLiLPLR 
APGLKHHGSQAPGLS\QAKPSLPDBHDVAAKb\CPLDPV\GP5P 
QDPSLBASGP3PKPA0VDISEAPQTSS PCPSADXDMKEMGRTAB 
KLAHPVAQVG\PBIEQP\S1\EN5TDWPDLNPL\HDQ3SS\APK 
Ff\RKKVPHLCB3ICgT SS PH H L\HTGQGDTT\GSQBfiPVDLMB 
GSABPHDBPPPRBAELBSPBVMPEBEDEDDSDGGBBAPA\PGRG 
GPSLBGSTPADGLPGEA\AEDDL/ALGAPALPTGLI4)VTCFFFG 
MPS3TSLKV5HIPAPKRVCIJQKPKVHBPVRIAYDRPRGRPKS 
KKXKPK0L0PAQQKL\TDtC\KLGPQ\MLQXKGIfKSGBaU3SIGK 
GIR\SHSACtQQAAHGGSGHGLSPSTCSLPLGSF l ZAECHAY5MQL 
IPVP 



I^PSGSLFPHAIO^DVK/ITrLNSLVVtTDTSGXVSVyKNDIXSRP 
wltcscqgmltcvgvgdvckkdkhiavavsakgwfhlfdltpak 
V 111ASCTHBTLIOTBQRPVFKQHI PANTttV KLI SDIDGDGCKKL 
VVGYTOHVVBAFraBKBLGKGPBHLTGQLVSIJCKHMLBGQ VDSLS 
VTLGPLGLPBUWSQPGCAyAUiLCrWKKDTGSPPASBGPTOCS 
/SGPPSCPRftGAAFPIin>YFQQBCLHSPHWQHQT\SHGraS9GS 
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HLPUWSKVAGAAAMKPQBKBBlQVaAlfliULiraJtlFXrVZSV 
ITPaTTBIIfiHLaEKHRVWJRITVTtVlKDIJCQKRSBTBSBAKTWJ 

IJLLHKSV?frSPAKl£STQSRXLHAL»VDSAVALETKDTSLASFIP 
AVNDLTSDLFRTKSKS BBIKlELRKIiKHLTATLVI^KKCLQraJV 
KKAHUn*STER>VuWIHraRCfiM\OFLKftKBBBPRPOIOAhOEQL 
SAHGQ\nAF5VPIQSLVAIJCRBSWPRLKQQTI PLK\ KKLH3YLD 
LMpWfi»CSEC*RIHEAK\HKLA\SXBAKLTRKVS\M«iiL 






LSHKLIEMLAISHH3B3iI>QM35HlffQVLSLLIHRDGHPQKIJi!KLA. 

l^QCgBHHrC?VLKKBVE]CRDSDIQQLQKOT«KR A BQ IT iMAVYQ 
ftKEKLKSfKgftRgGAl 33BRXI KYAH R I SA SH AVC71PLTSVPGD 
PaRpypTDIBMHSQjlXX 3 *tfflIPST!*OTH^U»POTMjft/RHKIJiR 
CPCSTVS/WGSQlyrC!l*XHlXI«II^QKSVCSL 



* : r«) * • : h -j : i v tw li *;»Sk 












HmSSMSKSM^ B 




GTT?GGASBftCBD5tBftPGKBtRRtOFIiATWfL7FTD IWTTRGH LVL 
ALMfVRFYMBKCTHRGLYKS IQHTLKFPQrTPALLHrVHCXIGlV 
PTSVIVTGVQVSSRtFMVWI*ITHSlKPXQNKBSVVIjPLVRWTVT 
BYTRYSFYTPSIdAHLFYFI KHARYNVPI ILYPVGVAGKLLT I Y 

aalphvjocpomps irlpnbSemvspdyyyflIiIImasyi plppql 

YFHKLRQRRKVLHG\0*I>* WUfIK*3LQroCPPCiraaDYLSP3F 
NHKKKQWmSWIVWPLRI 



SJjHCLVWG(a/MPSS^PlX2KI«riAjU»REAH£?irSRLT«CSR 

SVI 3 KDK 33 SGWirPSW^I»U>SPVPIiRPPGPVQIiPSCIGItVLSD 

♦BK<R0RCS3/WLSQpqHHRBKEVVM«MSMMBCCU tf UUlSDVL 

CR5XAHRT8QLRBTLTAXRHHCQHLflKf3»nKBQHAqitHVGERSP 

DQSJgHTDGHTSVQSVIBKLQBEHRI»I»KQKVTHVRDLHAKJrQB.YN 

j^e ap HVV BgL aiMCTBLQI raSPBPiWCBIfiBMWOLBgKIISroC 
AZVBQSLAASIOSUmAAIiERVQMLBQOllAyKDDFMSERADRKR 
KQSRlQBLSBKVASUdKSVSMR^SBEPDACBIBMSSErAKSlA 
ADAIiSWfyPOGXRPGTGSQQPKPPABGOTPGAAQBGQGDLQCPH 
QjQCPSDKQGKBLLStHVRBCOQ 



SBVKKPGASVNV 5 C 7 VS 0 inrLTKL£MHQVll(U^PGSBLB*MBP 7 D 

IXJ 0 VBTIYPaKJ^JUKVSM r IRlfiSTRTTX^ATUBI> 33 IjH 3 BZn , AV 

HHCTtTPTy 

SGCVEMLYSHSLSYHPBW ISVQSfcVAPAQIAUiSDGDL* tilSGK 
RTRIdX*QiP3AGGPGLQSPIjQU5KLDXTSDBFILDKVDG\VDLR 
HYSKQVHLBLQQIBQKSlfiDYIQESKNIASLENQITXCnAVLBE 
KBQHLGRPOSDLSS XSSBIKTIX^QSGAKIJIBIJIHRQAVRGKLG 
klvdglvvpsalvtailbafvteprklkqlqkldaicaaavheqe 
IVR firAArADVHlWIJjPTiRVKAVTKIRBFXLQKlYSFRKPMlNYQ 
I pglALLKtIUTYQPIJjra®RATAKBIBDHYVETX*SKXYLSYYR 
SYUSIJCCVQYEBYRBKXJiaiWGVBXJrAKKBFPSiCPSljRSRIiTlF 
TUOTSSVISPT5I»BaPIiVPHDU2Ed5BQRYPPBALFRSQHYAL 
I jai 5 CRBYl«FICHPPVVSGPAJ^DLPHAV>CKrLSM i n*KHIiDSY 
UOXOTMAVPIXriMVUlWUrlAABHDVPAUIRTHKQVLftLLW 
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PRPBIJLEMHVTOVRSTDPQRICGLDTRPHlf ITRR^AKFSSALV 
SIKQTI PMERTMQLI/3QLQVEVKNFVLRV7AB PSSSKEQLVFLI 
HHYDI®fIiJVm\B*BRAADDSKEVBSFQQLIKAUTQBPIKKLLS 
PPFGOLVAPVKBAKAIiIERGaftERIxRGEEAHVTQLIKGFGSSWK 
SSVBSLSQPVMBflFTWFR HQ T S I I QGALTQLI Q VLTHRFHRV\X> 
SQPQIiEALPAHASLIHIHHIXVKLKKHJCPHF 



ggggQVQ)OOGSVCSLSPHKHRGLSLTBTgLBBLBAOVLQLV3USL 
BgragLAGQBHPaSLBL0GLL2DERI*ASAQQftBVFTXQX00LQG 
ELRgliRBBrSIiLBHSKESgLgEIBQRLHLflQ^BIQSLRQAAKDS 
ATBaBSPIASLQBPLCSMQHBLBDMKR IttGPgBMBIASmftPiB 
HKSagPSCBI^SLSPTSGBLQgKLQBiLKBKTHPmBgTttALQBSgS 
SLTGOLWJLBSKRTQRATBfijrLQSQTLSWTSAESOTSBKDFLB P 
DPED 1 QIiL£QQl£DAEBQHH( 9 CXVKCQBLCCSLEBI^BHRQVfiBB 
BQRKLOaBIjKCfi(»reVLRPQTSHS \SPSHPLPPIPPSS PCLL • A 
LWISALLttCHMABTSS 



OVIJJlASAaLRVPLTaVRACAKPBVClRBP. 
SRL3D0GD7TPSBR1^0PKGAKUIfFQCAXFJUlIAXAKSS(3lVPVV 
FVAGOBBQSTQPV^HBDOKVTEASSKSFVAIKIDTKBSACLOF 
sqiypwcvpssppigdsgxplbvxaosvbat^lvtrihkvrqsi 
HU aBBTSVAMQSQSB3SVSTPSASFBPtanxSMSQSEHRKLCB 
1 PSTSDTKSOTATOOBSAGHATSSQKPSG(SDQRPABDIjHIRVB 
fiLTKKJ gHR SBEK R XRR BQRHI KJCBIBRRKTQKEMLDYIQftKQEB 

bvs&axaaalxjusoabmbvkrbsyarbrstvariqfhlpdgssp 

raQPPSDAPLKBAHQPAAOTVGirtYQNPS LMWPRRBFTICRDY 
KKKLLDLBXAP3ASVVliIiP/ALFIHP*AGRPTASTVHS3SGDIW 
TIiLOTVLYPFIAIHRLISHFLFSNPPPTQTSVRVTSSEPPNPAS 
SSKSBKREFVRXRVLBK3lGODPKKKGKXyRIJrrQDDGKDKHNTW 
NGHSTOQM 



AAGGGRRSRX*SRSWPTGPSKSPSGVRCX3G\fiR\AHEDKDBFLDV 
IYWFRQI IAVVXOVIWGVL PIiffiFMDU3PCUIHRGVLYLYPSN 
YLQIDEBEYGCTVBliTKEOPKrsPA/ IVHGHLDHLliICHPl« # U4 
VTSSQVLPIQ3K3PS 



GtPGZiLSPBMVRl'UWTLKKBPVOTPTtCGDPELCTSSLPPXXHQ 

BVLLKALPLTVDPTMgVAfJCRIJCB GD T H MGQqVAKVVBSBBVAL 

PKGTIVI^SPGWTIKSISDGniliHKLLTBWHJTIPLSLADSTVG 

MPGLTAYPCLLK ICCVHSGBrTVHVKMiJ^ 

KVW^VGSDBKVAn£XDGFI7VV?ifYETVESLS£TliKICASPDGI 

IXJTFnNVGQBFSOTVIGWKKFGHAIOGAISTYHKTGPIiPPGP 

PPBIGI7QBlJZNBAFVVTSBQ(3ABQKAli£DIiI<KRVLBLPyFVX 

D*]jQAHTLVyKSMKSAKPSIiEYTSHKLVSO\KI QYKHYIIBCFB 

WAAPMGWiKCHWLGKTIVIA 



APGCRCCttAJgCPCKGPRWDSXjOPBAAaSPflAPGqAPQLIiSL&B 

RFDRGHPGGDDRGPQLHROSPG/SPSKL3RRPGPPGiiPQIxQGPP 

papglpqsrtl/pvlcvcdlspaqcdibccxdpdcssvdfsvps 

ACSVpVVTGDSQPCSQKAViySLHFIMIPPQRVFKLVDQINPSI 
FCIHITO\ # OT^fYPIjLIQKYL/HKCOCFjDTI/(!CrSDGPTIjHAKSy 
VSFTTKLDZ P17\AICYKYGVPLQTSnSFLR PPSSLTSSLCTCWNP 
AAFLVNOAVKOTUaHLEOCBBtEALSMAFYaSPKILRVPDSRK 
KVPITVQSIVlQSXJSXTIjlRHKDTDVIiQPTLVHflGHFSLCVNW 
LBVKYSLTYTOAGRVTKMILSFV1 iGTV 3SVVVPL0QKFEIHFLQ 
BHTQPVPXpSGHPGWVQLFIkAAGPQPHHOSGlIQTlBRYGQLTI 
LBSTTEQDCIAXiEGVRTPVLPGYTHQSQCKLRLTGALPCQLVAQ 
KVKSLI^GQGyPDYVAPPGKSQGP/ACMUJSfVPXHPITQSPKRK 
DSOQLPGALVTBVKWTKTCSmfPQAKIVHV'DVNLlSSSFPEAN 
SGNKaTILISTAVTFVDVSAPAKAGFRAPPAINARLPPHFPFPP 
V 
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Predicted end 
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corresponding 
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sequence 


Amino acid segment containing signal peptide ' 
(A=Alanine, C=cystelne, B=Aspartic Add, . 

Glut&oic. Acid* PopJtenyialanine. G*Glycina, * " 
H=Histidine, I^Ieoleucine, JULysine, 

Leucine, KcMethionine, K^Asparagine, 
psProline, Q=Glutamliie, Hsftrginine, 

$»Serine, T»dhreonine y V=Valine, 

WcTryptopban, Y*Tyrosine» X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 

\ a po8 Bible nucleotide insertion) 


5080 


375 


2302 


LXCRTPGVABQjgAIgEQPStatPRClPSPRTPSWTPSAEAPWSPQ 
LSIfPOVETSGPBQVCSFLRHGGFBEPVLIXSIREHBCTGALLP 
ClDXSBFHmiGVSS^CSWaa^YIUHLVaiHVlTrHKVINDPIH 
CSX QfiPXI»VRIlDTPQF(9CLRYHQU3GGYYVFFQtSHHRFSH 
8r/3VGYliAGCIiVHMiGKKQPHLQXS3RDVI^C7Q IAGLCHDLGHG 
PPSBMPDQRFIPLftRPBVICIOTgQOSVIBCFBHMHSHDIKPVMB 
QYGLIPKKDICFLEEQIVGPLBSPVEDSLWPYIGRPEincgPLYB 
IVSHOHGTDVDKKDYPAADCHHl£IQMNPDTK3tFTKFARVCBV 
IJHBlJUCARDKBVO^YDMPHTHHSLHEtEAYQHKVGBlIDTMIT 
DAFLKADDYTBXTGAOGHEfSlSTAXDDWHAmCLlZQIXFIiBXL 
YSTDPKU^ARHTLKQIHYRHIiPEYV^TQPTGQIKXgKKDYKS 
LPXEVASAKPKVLUTVKZtKABDyiVDVXKMDTGMQBEHPIDBVS 
PYCffTAPtlRAIUTKKQVSQIi£iP\BKFABOVLIKVYCFCKVDRKS 
lyaVarqypvqiAcadr\het\kpqdgrcy< pptp *hpqkkcw\ 

HD3TFSPKIPTRI*PRHLPKSRV\QLFKDDPM 


1 


Ib5I 


731 


LPAtokinrriU^aOAP^RSG^HDI^PYPPOVFPPRPDKVAl - 

VTGCTDGIGYSTIAKHLAHLGHHV1 IACMHD5KAKQW3P KHtfT 

LKDKET * VIOiCCPGMXjCUINSSDPPTSASSGASTIGVHHHFLLK 

FGIPIL\DLASWTSIHQFVOIQPXMlEKI?IjHVLIHNftGVKMVPQR 

KYftDGPBEHPGUm^Ba^TOUdJmSKSGSPGHSARVVTVS 

SftTHTVREUDCDOLOSSACYSPHAAIAOSHlJUjVIjPTYBLQRLI* 

AflBGSHVT?\HVVDPgvnWTPIjrKHVF8IAXBlAHgLLGHIAPgrP 

D8GASVT5rXAAVTPBLBGVCGRYt*yHnCErnC5IH^YHQKDQQQ 

LJNSKSCEMIXJVLDVTL 


r 5890 

i 

i 


1322 “ 


200 


PRRGtfSAAGHAVPVAFCSRISASSPaSPRGAVRLOSGTBAACRS 
GWOPRPA&AAGGHAGBRMSQROTIiVHLPAGGOGGTVGAXDTCP 
LBVVKTRIOS3SVTtiYlSBVQIOTKAGASVNHVV^PGPlflaWCV 
ILBKBGPRSliPRGLGPNLVGYAPSRAIYFAAYSNC!KEKliNDVPD 
PDSTQVBMXSAAMAGFTA2 YAYSPIWLUCTBLQL*' / 3QCTAGKK. 

r>gj^cvrkvyqtdgi^ptrgmsasyagisbtvihpviybsi 

B[QKLLBYKTAS7MSHDBHSVXKASDFVCM1LAAATSK\LVATTI 
AYPHKVVRTHiHKJKrX^YRSPPGTLS IXVQHBCm*SL¥Rai/rTH 
LVRQ1P\NXAXHMATYBLWYLX29G 


■ 


1322 


200 


FBRGWSAAflRAVPVAFCSRlSASSPRHPRGAVRLOSGXKAACRS 
GRPDPSPASAAGGHACSHKSQROIXVHLyAGGCBGTVGAIXTCP 
MVVKTRIAJSSSVTLTIfiEVQLHTWAGASVNRVVS PGPLHCLKV 
ILEXBGPRfiLFHGLGPNLVGVAPSRAlYPAAYSlJCKKItlAIDVFD 
PDSTQVHM I SAAMACPTAI TATNPIMLI XTRLQL* /SQCTAGXR 
JOtjAPBCVRKVTQTDGLKCPTRGKSASYAGISBTVIHFVIYBSI 
KQfCLLBYlCTASTHEHDBESVKBA3DFVlM1LAAAT3K\bVATTI 
AYPHEWRTRLBHBGTKYRSFFQTLSLLVQBBGYGSbYRGLTIH 
LVHQIP\HT7UKMATYBbmUiI«G 


5092 


•• 


379 


WLHV OGRLS VMSAV5SRTGGHSJU^TCA> i K}SW1VVIjGH 1»RGPA 
DaSWPQAAPCLSOaPHASAADVVVVBOBttlAICRAGttGGPKDT 
TPDKLLSAVMXAVLKDVHLIIPBQIiSDICVCSVLQPGAGAlMARX 
AQFLSDX PBTVPLSTVNRQCSSGMWVBSIAGGIHKGSYDOKjMA 
OGVBSMSLADRGHPGHITSHlMBKHHARDCHPW3rT3BNVAKa 
FGISRKKQDTFMiASQQKRARAOS RGCTQABIVFV1TTVHDDKB 
TKRSXTVTQDBGIHPSTTNEG1JUCLKPAPEKDGSTTAGNSSGVS 
DGAAAH*IARJRSKAHHDGLPIICVLRSfAWOVPPDX>l3IOPAY 
AI PVALQ KAGLTVSDVDI FBINE VAFASQAAYCVKBX2L PP*EG 
♦TOUXJASGP^(MPLGLH»GHVQViriAQ*S^SARGKSAYIlSGC 
PC7UGSWHGSPLPVPBYPWGT 


5893 


3 


1653 


IliSKRRCXJKAKTKKLllAKKVAVIGAGVSGLlSLKDCVDBGLBPT 
CPKaTBDIGGVHRPKBMVSIXSASIYQpVVHytSKBHSCPSDPP 
MPKDFPHFLHHSKLLKYPRiraKXPDIJUKYIQPQTTVLSVRKCP 
DFSSSGQWKWTQSHBKRQSAVPDAyWCSCHHXItroiPLKSPP 
GMBRFKr^YFHSRQYKHPDGPBGKRIIATEGHCailGSDIAVBLSK 
NAA(3VPX5T1llffiTWVHSRISBDOYFHD5VFHTRFRSMLRNVLPR 
TAVXHMXBQQMHBMPH&BHfGLBPQNKYXKHEPVUiDDVPSRU* 
COAISVKSTVKBLTBTSAI pbdgtvbehxdvi ipaotyspsfpp 
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nucleotide 
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amino add 
sequence 




Amino acid segment cont aining signal peptide 
(flaManlne, CaCyotcine, D^Aepartic Acid, B=» 
Glutamic Add, P=Phfinylalani ne, G=Glycine, 
Histidine, I=>Isoleucine, K^iysiue, 
fesLeuclne, M=Jtethicmine, H^Asparagine , 

P= Proline, CMHut amine, RcArginina, 

SsSerise, T^Threonina, VoValine, 
HaTryptophan, ToTyrosine, X=Cnknown, *=Stop 
Codon, /=poeaible nucleotide deletion, 
\cpossible nucleotide insertion) 



IgPSbVKVBHWHVSLTKrigPAHItPKSTLA CIGlJ QPLGSIgPT 
ABUQAEWVTRVFECLCSIiPSEBlfOMDI XKRHEKRUJLPGBSQS 
QTLQTHYVDYUniAI^CAKPDFCSLIJPKDPKIAVRLYPGPCN 
ST*YHLVGPGaNEGAfiHAIFTQKDRILKPlKTRAIiKDSSNPSVS 
FLIiKTI f?LLAWVAFP\0QLQK3 






pt.pgT^Q Qtrr^pgTirr rgTrrrJgTJ.V TJaQPi.T ILPXHLQARVQBH 
HPBSRECVVVVnJffiLQLDUJBTGQQVnPDQPXKQKXLVBBMAFl* 
KGVQBQQVRHECEVTKPBKEKGBBTRlBNGKliIVVTDS CGRVBS 
fiOPSgPMRAHNRTSm,BaHQRKPKBgIEYgCSEBB0RFIggiP 
IJBHASTHTGKKbCBSDVCQSSSbTQHKKVljS* ERKVTQC\BGV 
UJHAFURSSHLVBHQKIHIlaBKPyQQiBCGKVFSQNAOldiBKLR 
IHTXKKPYLCTHCGKMFRRSSKXJiIHHQRIHS0KKPCTCKBa3KT 
PSQALLLTOHQETHSHS KSRQOigCQKAFSl,TSDb0MHRXHTO 
BKPFKnflOQKAFHLNSULAQHyRIHNHBKPIQCSBCQBAPRQR 
SGLFQflQRYHHKDKLA 



HPSlliGAXPPFPPPSSPWPPPLYLmtSHRlCSRHFIKQHGIHaB 
KRLFVSDGVFOCLFVXAAAORAROBABVUSTVQPBDCWPFIiT 
HPKVPVLQU5SGMYLP9T3AICRYFF\IiL30MBQDDLTHQWIiEM 
HATKLQPTLSAAI»3fnj\VVQC3XKO\BDVlGSVRRTLTmPHSLS 
RQVtfCPFIAGBTBSLAD IVbWGALYPI*IiQI3PAXliPHBLSAI4iSW 
FOTLSTO\BPCW\AAHM»WIJCQ\QaviiALR\PYLQKQPQPSPA 
BGKGLSPIHPK2KHLATLSEBB IAHAVTAKBB7LBSLPPUIPQQ 
NWLPVAGBWIVbrrSALP YVKMVPHL®JI IGCVLSADVPARYS 
MJtepWTLYL OOTPg YGTATBTK AL\BEQLTPQKICDKYH1IHA 
DI Y \RVFNI S FI) I PORTTTPQQ\TKir\QDI FQGLLKRGFVLQD 
TVEQLRCBHCARP\LADRFVBGVCpFCOyBBARflDQCDKDQKLl 
NAVBLKKPQCXVCRS CPVVQSSQBlFI^bPKI£IOUiEBWl/^RTL 
PGSDWTTOAQFITPFFGFREHPSJCFRWQ^TRDkKXHGNPGTPfrR 
GFRDK\VPYVWFnATIOYLSrTAJmDQWBRWW \KNPBQVDLYQ 
WH \AKDNVP FHSLVFPSSAI/3^KDNYTL\VSHL IATEYLMTEDG 
K\PSKSaGVUVPRDM\AHDTGIPPDlSRFTI»\LYIRPEGEC\DSA 
PSWTDm*KNBS\BLlHNLOIPIHMt\CMFVBKFFGG\YVPBKV 
I/TPlX)QRliLA\HVTIiBXiQJlYllQ\LIi8XVRIfil2ALRdll/X13\RH 
(2JQTI \QVNBPH\RRIKB3BADRQRA0 TVTOIiAVNIAALLSVML 
QFYMPTVSAT IQftQLQLPP PAC5 IXiLTJTFLCTbPAGBQlGTVBP 
LfOKI*KMDQIBSLRORFGGGQAKT 3P KP A VVBrVTTAKPOOIOA 
MDBVTRQQflVRBLKAQIADKHBVAASVAKliljDLKKQIAVABG 
KPPBAPKGKKKK 



khmvsdgvpoclpviaaagrardrabvlistvgpbdcvvp flt 

RPRVFVLQpCiDSGHfLFSTSAICRyPFXLLSGVBQDOI/lNQVbBN 
BAXKbQPXL3AALYYL\VYQGK3Q3\BDVLQSVHKTIjTHXI)BSLS 
RQ\NCPFIiAaBTBSIADr7L»GALYPll«QDPAYlPEBliSAIjHS« 
PQXLSTQ\BPCQR\AARRLVI*KQ\QGVlAm\PTIAlKQPQPSPA 
BGIOX^PIBPEKKHLATLSBBKIAKAVTAHKKBLBSLPPUIPQQ 
KWLPVAGEHKVLITSALPYVNNVPHLGNIIGCVI^ADVFARYS 
HlRQW»TLYIXOTDBYGTA!rBTKAL\BBGLlPQKIC3)KYHI IHA 
DIY\RWFNIBPBiroRTTTPQQ\TKIT\QDIPQQU^RQFVLQD 
lVBQI>RCBHCARP\LADflPVKGVCPPCGYBERR(H)QCDXnJKLI 
HAVBLIKPQCKVCRSCPVVDSSQHLFIJJLPKLBKRIiBBPlIiCIRTL 
PGSIWTPNA)aFITPFroPRBWPSKPRKQ*TRDLK\HGHPGTP*K 
GFEDK\VFyWFIlATIGYl*SITMIY7DQWEKKH\KHPEQVDLYQ 
PW\AtCPSVPFRSLVPPSSALQAKPIWTl*\VSHLIATBYXJryBDG 
K\FSKSRGVGVFRI3H\AHDTGIPPDISBFYL\LyiRPBOK\DSA 
FSHTDU*LRHtfS\SLLHNlQVFIKRA\GMFV3K7FGG\YVPBtfV 
LTPDDQRLLA\HVTLELQHTHQ\LLKKVR1RDALRS ILTI S\RH 
GBQYI \QVHEPH\KRIKGSEAKtQRAGTVTGIAVIfIAALliSVP!L 
QPWPTVSATIOAai«QLPPPACSIU.TI?FUrnJPAMQIGTVSP 
LPQgLEMD QIBaLBO RFGGGOAKTSPKPAVVBTVTTAKPOQiQA. 
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amino acid 
residue of 
arrlno acid 
sequence 


Predicted end 

nucleotide 

location 
corresponding 
to first 
amino add 
residue of 

Hwi no acid 
sequence 


Amino acid segment containing signal peptide 
(A=ftl»n4na, ocysteine, D^Aspartic Acid, 
Glutamic Add# Fo Phenyl alanine j G=dycine, 
H=HiBtidine, I=l30leucine, K^Lysine, 

T.^ Leucine, M=44ethionine, tfeAsparagifte, 
PaProline, QaGlutamino, RcArginine* 

StsSerine, T-threonine, V- Valine, 

W -Tryptophan , Y-Tyrooine, X®0nkne*rn, *=»Stop 
Codon, /<=possible nucleotide deletion, 

\ sponsible nucleotide insertion) 








UO)gyfKjQGWIVRgXilCROKADKHHVRABVAKUIdJIt?UttjXjAVABG 

kppbapkgkkkk 


"5097 


2967 


66 


hPSlXGA!ttFIPPPSSPWPPPLYLFWHSHRKBRHFIMQRGIEGB 
HHLPVSDGVPGCLPVlAAASRARGBAgVXJgrVGPBPCVVPPLT 
aPKVPVLQLDOTSTLFSraAICRYFFVhliSGWEQPJLTNQMLBW 
KATBLQPTI»aAALTTli\VVQGKKGVBDVICSVHHTI»THIBHSI*S 
RQ\NCPFLAGBTBSZJU)IVX*rcALTPUK2DPAII^KLEALHSH 
FQrrbSTQ\BPQQH\AAHHLVI»KQ\QGVI»AI»R\PYMIQPOPSPA 
EGIgLSPIgPHBgELATI^HHBIA)ft,VTAHB3GS^BSLPPUPQQ 
HFVIiPVADHRHVLITSALPYVIDIVPHI/nri IGCVLSADVFAHYS 
MiBQHMTLYLUj liHt tGlMBTKAb\EEGl*TPQBlCDKy3IXIIA 
M7\H5IPHISPDIFGHTrrPQQ\TKIT\GDIP , QQIiI»HRGFVI<JD 
TVBQUlCBSCARP\l»AlJRFVEGVCPFCC7YBBJVRGD0CDKCGlCLr 
HAVSLKKPQCKVC^CPWOSSOHhFIiDIiPtliBKRIigBWr/grrL 
PGSDUTPHftQFITPFFGPREH PSKPKH0*TRDMt\B<3JPGTP # S 
CTB3S\VPYVHFDATIGYLCITANYTOQUKRIW\HIJPEQVDLYQ 
R4\AXEHVP PKSliVFPSSAI/lABDHYTUXVBHLIATBTIjWYHDG 
K\PSKSRGVGVFRTH\AHDrrcrPPDISRFTL\LTIRPBQJC\DSA 
FSWTftT r T -T.KMyff \pr VjniLffWPTWP A\f3<FVSKPPCG\ YVPEHV 
LTPDDQHLLA\HVTLKIjQHYHQ\UjBKVRIHDALB 3 1LT ES\RH 
CMQnfX\QVHBPIf\KRIKSSHADRQHIU9TVTGIiAVHXAALIi$VMIi 
QPIHPTVSATlORQIiGP^PPACSlJjLTNFIjCTTiPftGHQXGTVSP 
XjnQKliBKDQlBSIiRQRFGGGQAICtSPKPAV VBTVITAKFQQIQA 
IMBOTKQGSnVRBLKAQKADKHBVT^AKVAKI .LDLKKQLAVAEG 
KPP2APKGKKKK 




U 29S7 


06 


HPSliX5toFlfPPPfiSPHPPPIiYIiFWHSHEXSBHPIHaftaiHGB 
HRliPVSDGVPGCXPVLAAAfiRAROBABVLISTVOPBDCVVPPKr 
RPKVPVLQlJ5SGNYI^3T3AICHTFP\hLSOHBQDDLTNQWLB¥ 
BATBDQPTLSAADYTL\VVQGKKG\BDVljG3VRItTLTHIDHSIiS 
RQ\HCPFIAGBTBSLADIVLWQALYPUiQDPAYIjPBBli3AIflSII 
FOTI*STQ\KPCQR\AARHLVIJCQ\OGV1 »AIjR\PYLQKQPQPSPA 
BSKQLSPI BPBBggLATSSgEBIAMAVTAHgKgliBSLPPLBPQO 
in^VLPVASESNVMTSRLPYVHNVPHLGNI IGCVLSADVPA8YS 
RLRQWm.YLOGTDEYOTATBTKAL\KEGLTPQBICDMHIIHA 
DIYNHHPHISPDlPGgrfTPOQXTKITXQDlPQQ^bmOFVLQP 
TVEQLRCBHCAHF\LADHFVEQVCPPOGYKBARCTQCDKOOKbl 
HAVHIJCKPQCKVCRSCPWQSSClHLPIDIiPKIiBKHLBBlffhGKTL 
PGSDWTPHAQPITPFFGFHBHPSKPRWO*TRDIJ^WIHPGTP'*B 
GF5DK\VFTVllPnATIGYLSrrAirfTDGin5BIW\KBPEQVIHjYQ 
PH\AKiiNVPPHSLVPPSSAI/^KDHYTL\V5HLXATHYI«HYKDG 
K\FSRSItf3VGVPBDM\AHPIGl PPDX SAFTL\LYIRPBCK\DSA 

PSirrDIlJJWKS\BLLHHUaiFlMRA\GMFVSKFPGG\TVPfiMV 
LTPDIX2RIJA\HVtLBLQHIHQ\I«LBKVHIHDALaS 1I^I8\RH 
C3»QYI\QVHBPW\KHIJC3SHADEQRW3TVTGLAVNIAAI»I*SVML 
QPXP1PTVSATIQAQDQLPPPACS UuLTHFDCTl^AGHQIGXVSP 
IJDXLBKDQtBSLRQRFGGGQAlCTSPKPAVVBTWTMCPQQIQA 
MDBVTKQGBIVHHLRAQKADXNHVAABVACLLDLirKQLAVABG 

KPPRAPKGXXKK 




326 


1070 


KCPKSfcBPHGVHAPSLPSPIJlAAHALSDVDVXKQIKHMMAFiEQ 
EAHEKAHBIDiAKAEKBPHIBKDW.VQTQBI»KIHBYYBKKBKQXB 
QQKjaXKSIMRHQARIXVLHAHMDLISDLLSKAKLRIiSRXVBDP 
BVYQGLIjDKLVbQGMiELLBPVKIVRCRP \0DLLLVEAAV0KAI 
PBYtfnSQKHVBV\OIDICBA*LAVBCSWBVHBVTSGHQRIKVSK 
7LBSRll)LSJtfQQ^MPEIRMALFGAHTNRKFFl 


1 


64 


14^5 


KRASM>5PCI^PCBl»OOVSSHDI<giaWYHHl^HLHSHL0PLLK 

GGVIYPALPQPRFKSLLPLAVHHHBTASKSLTCWiQGHEDHPKL 

KtAirrVMHFOTVIfLEDHClRSASCiHSKXBQIlSLOTASVDIXZIKP 

KnW*DBTTIiPPTWFIX5HVTKHnJ3liLViaiSTBGQKQKVIQPai 

LWNABIYQQAQVPSVDCQSFLBThBGIiKKFLQIJFIiLyGIAPVEN 

VPPTQHITBKLASBISLIRfiTlYGIUWYFTSDFSSiaJTArTKLA 

LJjHHTDTTYFQBPOGIQVFHCXJfflBCTOZBTtiVriGFYAARQVL 
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irrrrnr— 


■rcr.irrr^FT-ai 


beginning 


nucleotide 


nucleotide 


location 


location 


corresponding 


corresponding 


to first 


to first 


amino acid 


amino acid 


residue of 


residue of 


amino acid 


amino acid 
sequence 


sequence 




segment containing signal pep 
(A-Alanine, OCysteine, D=Aspartlc Add, B- 
Glutamic Acid, F^Phenyl alanine, G=Glycine, 
HoHistldlne, 1= Isoleucine, KaLysine. 
L=Leuciixe, M=Methionlne, ifeAsparagine, 
P=Proline, Q=Glutamine, RaArginine, 

S=Serine, T-Threonine, VaValine, 
H-Tryptophan, Y-Tyrosine, X-Uhknown, *=Stop 
Codon, /^possible nucleotide deletion, 
V=poaslble nucleotide insertion) 




VAXBQTSLKWQAVGGftPARPTQBYICtrQOQ RKyTSUJSF QTHL 
n vrim n.p KLTCPQCiJKBFPirQKSLLKHVTIHFICTSTYY I CE 
SCDKQFTSVDDLQKHLLCWHTKV ^FKCriX3QBVPDSKVSI0LHL 
\AVKHSHBKKVYRCTSQWDPRNKnJLQIJIVKHNHLBSOGKVHK 
CIPOCBSFGTBVBLQCBlTTBSKSQaiCKFCSKAFHAlIIXBXBL 
HEKHWFBTKTPHCXrr»GASBaVQKBBVM^m>TKSQBSHHSH 
BGS8HDVDT8BPH?GCDIGBAAyiWBTIiIiQ)H(PjRIIHHIRP0B3 
AIVKKKABLIKGNYXCNVCSRTFPSBIJGIJIKHMQrai/JPVKHYM 
CPiaSBRFPSId^TBHKVTHSlCSUmaraaCKMPUlSBBBnf 
Kff03WHPDLRBSLTGF3K^V05CrtVI^LKUatBGTFHMQKTGM 
GSAWBTTGRGCJHVQKbYIDCASCIiKBPRS XQnbVKLDIHGLPYGL 
CMJCVHI*SKSASP0IHVPPCTHRPGU5QHEHI»SA1E0JCGKVGGI* 
CTRCS*IATFKP*VIJCVBLPBI*HPKPyHBGVSRK)SBSTQiLJCTP 
QVS P*n> PTQ PSQSDXICKTYQCXKOQNVFYNBKDIQVBYAHHKI d 
J5GLOTBCKLCEQTEIJSBAKLQCHLIBHSPBGHGOTFKCPVCFTV’ 
FVQAHKI^HlFSAHGQBBKiyDCTQCPQKFFPQTBLQNHXt'frQ 

H3S 

!LKHRRRSRP3IRQSI&ST3V3KKIjT31jFTYID1H ADVQ*V*RB F 
IPL3CPRO*BD*MFOSWUIAIWnyriiBBIkFKQC3UW>W'OYHTiyroT 
VKPLCrVRVETQGDDI^IXPHFLDBHLYKFSADBPPI P\GWGB 
BFS^KHPQGTBVKAITIBAMQVYMBBMPBVFVirDI 



DT 

PALPALSAVPGOAA9PV!PPSGIiELIiPLIiIiPLL>fI»LVLTPGRP7UV 
OLSTCKTIMBLVKRKRIBMRGQIIjSKIiRIASPPSQGBVPPGP 
LPBAVUU,YNSTRDHYRQBSAKPB PBPBADTTAKBVTRVLMVET 
HNBIYIUCPKOSTHSITMFFHTSBLRBAVB'BPVIiIiSHABLRIdJU/ 
KXJCVBQHVEL YDKYSMNS WRyL3NRl*LAPSDSPBWLS FDVTGVV 
RQWI*SHGGEIKBFHI*SAHCSCDSRPNn^VDIHGPTTGR\ft(3)I* 
ATIHGMMRPPIiLI/ttTPLBKAQHLQS\SRHROAL\0XHY\CFSP 
HGOMICLRC/VHC*HLlFRKDI*\GII\KMl\HK\PKrnHABFC\L 
GPCPYIWSU7TQYSKVLAI>YNQ\HKFG\ASAAP\CCVPQALBP \ 

LPXVYY\V0RKgKVBQL3HWXVBSaCC3 



M KBB Xtf T? KBBOSI«r YBBUBBBlCITMtf 

AU»SBTKHAFRAISSKVPVDKVTPSTI^KVIZ)PBKkU)QTOG 
HQGAtfDOYDBQNPVKVHMKUKGKPTPllBBVLBHLPGKTQDBVQQ 
HBJWYQKPLiaBKRKKKS IQIWKXnCQQKRBBIFKLKBKMSITP 
VLPHHRQBDHQICQKBBQHKKQKlAVBWfKHQKSIBMSMRCASQL 
KBBBBKBKKHQXBROBQFKLKLLLBSTIQQKKBQBBFLRI^KBI 
Bffl^Tffifti jfrftT nnuuraTgqppi^nijnrrJgiglLPRQAKHDBlCSQ 
KQRRIAKLKBKVEHNVSRDPSRLY/lJTHQRLGHTlitPOnRTHRIiW 
ATSTTPr*GTSHLBTHNTBKSMR ____ 



Tggpppg^BJV^STlSSZAPaJ ^PDKKOaifiNWTVAFAP 

DGSYFAJiSQGHRTVBXVWiSQCXaifFLLHSTKHVTHSSSLBL?a 
Q}tfiDGG<S®KPBSfIIDOQDIVHSIAFGSSVPBKQ3HCVHIBMH 
RPRFGQDQIilATGLNSOHI KJHDVYTGKX»LIlfLVDHIGVVIUlL 

TTAPIX5SIiILVSASra)KTliHVWDLEDDGN\WKVLRQ?Q l HWVY'\ 

SCAFSPDSSMLCSVGASKAWAMLV* LRLCTHHSHT3AJMVL3 
HAKRVASIATGLGAirriG^SNIJU^QGVLyVHRCWSMSTFCP 
SPFLPFFPKVISPTVKYH*IaLSKJL2FQFYGIGSLTSSTHLM*SI 
NLSHGPSVLPPOII»SDSRDILRI»*PMLKPVIiIFF # K. # CIVSVQK 
KKKPKEIAUiQBKHLS*DKPP3Sm»I*0TBVHIRILPRAILH3* 
UiIFRI^NCI *TYS*I1DPFYIQMTYD31G*FGK6KMV1CF*PIBW 
*LYYFHKIAPSFaiW*HPCCLPKKFHtAVHXLEACSICFSS*A 
QVGt l P^t^M > gTmf<^C0Hg)Sffl^ , nAPIiSvir^iaa«VV8OKIC 
RBGGL*YI*TLF1SVYFS* LVPGlKGPOYSPWKbHCLYP MFHb l 
FKLTFWBWl*liffl.IC>SAUmJCTDPHLT>fffLSlFPIClJ«XX YIDt > 

YNUI*I •QP*YKHCMPVI^CW3B* , fllMlQiPI7> QP\mHMDgYnt 
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to f irct 
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Predicted end 
nucleotide 
location 
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sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, B- 
Glutamic Acid, FsPhenylaJanina, G=Glycine, 
H=Histidi_ne, Iolsoleucine, K-Lysine, 

L= Leucine, M=Methionine, NmAsparagine, 
P»Proline, (h=Ql utamine , tfeArginine, 

S=Serine, T=Threonine, V=Valine, 
^Tryptophan, YaTyrosine, X=Hnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Impossible nucleotide insertion) 








^KIJSMdnmRCDPSPDCUM^ 

BFGKLFPPPTPIFAGGANDBWVRSVSFSHDGLHVRSIxADDKMVR 
PNRIDRDYPVQVAPLSNGbCCAFSTDGSVIiAAGTBDGSVYFWAT 
PRQVP3LQHLCRMS IRRVMPTQEVQKLP I PS KLLKFLSYRI 


5906 


146 


2038 


RBGAGSGHMftSGA\YNPYIBI IEQP^MiRFRYKCHdftsSGSi ' 

pgehstcnnrtypsiqimnyygioskv\ritlvtk\ndpykphph 

m,VGKDC3U)\GYYKAKPC^P8\RHP\LFFQN\liGIRCVKKKBVKE 

A\I ITR\I KW3INPR>VP^1CQUJDIEIX3)1jBWRMJ PRVFLPDG 

HGHL\TTAU>PV\VSSPIYCNRAPNTAErJiVC3rVWKNa^VJ^ 

DBIFLLCDKVQKDDrBVRPVIiNDWKRKElFSQftDVHEGVAlVPK 

TPPYCKAITBPVTVKMQLBRPSDQBVSBSMDFRYLPDEKDTYQ!! 

BAlKUHlTIiLPQKljCQIMVETGFPHVDQDGLELIiTSGDPPTLAS 

QSAGITVNFPERPRPGIil/3SIGEGRYPKKBPNLPSnDAWRB4P 

TCVSSQABSYYPSPGPISSGLSHHASMAPLPSSSWSSVAHPTPR 

SGNTNPLSSPSTRTLPSNSQGlPPFLRIPVCSNDIiNASNACIYNN 

ADDIVGMBASSMPBADLYGfISDPHMLSNCSVHHMTrSSDSMCTT 

DHFIO^SMiLBIPSQSSVLDPRDLRQL^MSSSSMSAGABSNTT 

VPVSQSDAFBOSDFSCADNSHlMSSGPSNSTfiPHSHVFVQDSQY 

SGIGSMQPBQLSDSFPYBFFGFV 


5907 


99 


1873 


TTLbSSWSS * K^KV/ttW&KKI^WP YPQPAKQNGK 

KATSKVPSAPHFVHPNKHANREAKLXKiaWHBPCRBKQOAAREQB 
RQKRRTIESY CQDVLRROBBPBHSQBEVIiQBLMMFPQIiDDBATRR 
AYYKEFRKWKYGDVTIiBVllDARDPIjGCRCPQMKKAVIiRAQffiOC 
KLVLVLNKIDLVPKHVVBKHIJTYIiHNSLPTVAPKASTQHOVKNIi 
NRC3VPVDQASB8LLKSKACFGAB191iMRVIiGHYCRLGrBVRTKIR 
VGVVGLPNVGKSSbINSLKRSRACSVGAVPGITKFMQKVrLDXP 
IRIiHlAPGIVPGPNSBVtJTILBNCVHVQKIiADPVTPVBTILQRC 
NLBEI SNYIGVSGF0TTKHFLmVAHRI13KKKKGGLYSQBQAAK 
AVLADWVSGKI S PYIPPPATHTXPTHIjSABIVKEMTBVPDI EDT 
BQANEDTMECLATGB8DRLIiGDTDPIiEMBIKLLHSPHTKlAIltAX 
BNJTITVnrKIGDLTGYCTNPNRBQMGWAXRNVDHRPKSNSMVDVC 
SVDRRSVLQRIMHTDPLQQGQALASALKNKKKMQKHADKIASICL. 
3DSNMSALDL30NAZHX3VGD 


5908 


247 


975 

i 


HOGI KKRGEGSGSPSPASGGFQUJCQI P3PSLPS BKKTri^HTRA 
BTRTLRATLTRRPPttSHSTRI^FPMPIJXaXSGLASHK/PSroKR^ 
GtfRH PAKAAGASIUVAATGKRGCRMSKRYLQKATKnKLIiI 1 1 PI 
VTLWOKVVS3ANHHKAHHVKTGTCBVV7VLHRCCNKRKIBBRSQT 
VKCSCFPGQWAGTTRAAPSCVDASIVEQKJWaiMQPCIiBGBBCK. 


5909 


1 


5002 

I 

i 

i 


PAIPG3TI IWAPG5HSAARADGR»3SI>PSQ3QAPGALCGARAPP 
SSNLRADR SMI CAQARAGKNLYHNR FLGLAAMAFPS RNS QS LRR 
CKRPIRYSYNPDQPHNMDLRGGPHDGVTXPRSTSDTDLVTSDSR 
STI/«5RSSYYSIGHSQDLVTHWDXKKKVDA0DWIGMYLrDBVLS 
KNFLDYKNRGVHGSHRGQIIWKIDASSYPVBPBTKICFKSfXHGV’ 
SGALRATTPS VTVEHSAAPIFKSIGADBTVQGQGSRRI*I SPSLS 
DPQAMGUaCGMFPNPDPYLKISIQPGKHSIPPALPHHGQKRRSK 
I IGNTVNPI WQABQPSFVSLPTDVLB IEVKDKFAKSRP I IKRFL 
GKLSMPVQRUjBRHAICaMRVVSYTIiGRRIiPTDHVSGQljQFRFRI 
TSSIHPDDKKISI>STKPESAQIQnSPMNNLMSSGSGKPR3HAPE 
SSBSWKPEQLGBGSVPDRPGNQS IELSRPAKBAAVITBAGDQQi 
VSVGPEGAGELLAQVQKDIQPAPSAEELABQLDLGBBASALLLB 
DGBAPASTKEEPIiEBBATTQSRAGRBBBBXBQKKEGDVSTtiBQG 
EGRLQLRASVKRKSRPCSLP l V3BIiB , IVIA3AC5GDPBTPRTHYTR 
IBTLIjBSMPSAQGGSAABEBDGABBBSTIiKDSSEKDGLSSVDTV 
AADPSAIiKKDRBEPEGATPGTAHPOBGGHFPS LANGAAQDGDT 
HPSTGSBSDSSPRQGGDHSCBGCnASCCSPSCTSSSCYSTSCYS 
SSCYSASCTSPSCYHGNRPASHTRPSSVDSAKISBSTVPSSQDD 

ehkensafbsvpdsmqspeldpbstngagphqdklaapsghvkr 

SPSGLKSPVAGPSNRRBGBCPimNSQPVSQLPSLRPKHHHYPT 

IDBPLPENWRARlDSHGRVFYVDHVNRTTTOORPTAAATHXa® 

RSG3IQQ^BQU?RRY0IIIQimATBRSKKDSGS0SCBQAPAGGG 
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QGGGSDSBABSSQSSLDLRHBGSbS PVNSQXITLLLQSPAVKPI 
TNPBPPTVLHANywyRVFTSSTCIiKHMILlCVHRDAHWFHRYQH 
NRDLVNPIKMFAinra^LPRGHBI BCTDQQGICSFPVDBNSRATTP 
IDPRJ PLQM3RLPHHLTHRQHIiQRLRS YSftGBASKVSRKRGASL 
lAHPGfiSLVAAIHSQKQHBSbPIAyiSDKIVAPLRQPfillFBMXiQK 
HQPSliARNHTLfiBXXHYIRTBtaHGLKKLSCDADLVILLSLFBB 
BIM5YVPL0AAPHPGYSFSPRCSPC6SPQHSPGLQRASARAPSP 
YRRDFKAKLHHPY RK IiB AKn FGQ GPGKlKL IlRRDHXJ.BOTPHQ 
VMAYSBKBLQRMKLYVTP V G BB gLDYSGPSHBFFPIJiSQKIiFNP 
YY6LFBYSANDTYTVQISPMSAFVBSHLSffFRFSGRILG\LALI 
I^YI»IiDAPFr\aPFYKALI*\RI*PC\D\LSt>I*HTLDKKFHOSLQW 
MKDWHirblLDbTPTVliggVPGO V TBRgLKSGBlHTgfWT BKliK K 
BYXBKNVKNRVERGVVGQTSALVBGFYBVVEtSRLVSVFDASBLE 
LV IAGTAE IBIJCDSRHNTBYRGCrnnXMLVrRWPWAAVERPNHB 

®I10AQPVT(?rSSVPYEGFAJ^PWBPMGiaRFLP ^ IQCHOKTrS 
M>PRG\BTCMPDWM*5TVSPRTPMLYBK\LLTA\VBBTSTPGT 



VABFAAMSPGRTQircLDPRYTADLLBVZiKTOYOI PSACFSQPPT 
AAQLLHAIXJPVBIAIiTSXLTIiIiAIiGS IAI FLEDAVYI>YKNTliCP 
I KRim^XSSAPTVVSVLCCFGLNIPRSLVLVaiTITS PYAVC 
FYLI^4LVKVBGPGG!CKAVI±RTI>RinT > ftMVH , It3PCCCCCPCCPRI> 
LLTRKKIjQ\R*Ci«ALffl?rPS*R*R*PKWACPSSPTASMTQQTFL 
RGAGI>Y6STLSSA/CSTZiIJlU7TbQI ISRQARLULGBQnMQAKF 
ALPCfVLLILTALQPS rFSVIJa?GGQIACSPPY3SinTtSQrVI«TCH 
LLXIJBfTPMirVLTRMYYRHK D n gV g YB TPSSPDIiDliMbKMiRHM 
AJrtMKOCCTH 



QuPLAPClQGKGItBMaSPKPQSBlIRSSHSGAGLbVKNPSTPVP 
OGHRRGGAAFKYKPTPVVGPEQRPTGQKHMRGGVSLLSPRLECS 
GTISABCNLRLPSSSKSPAPAS * LAOIltWCHHRQLl PV7 LVBT 
GPHHVGQAGLELL/RWXHLlPRPPKVLOLQA 



MILNKAI^LGAIJU.TTVMSPCGGEDIVADHTnSYGVNLYQSYGP 
SGQYSHBPXJGDEBFYVDIiERKBTVWQLPbPRRFRRPDPOFALTH 
IAVLKHHXJJIViKRSNSTAATHBVPBVTVPSRSPVTLGQPinT»I 
CLVtHSIFPPWNITWLSSSOTSVTEGVSETRPSSPKSDHFrjjQDQ 
VTSPSFPFB* *DL*TAKVEQI£AJ9PBPIXKHWGAHIPTTI> 



QLRMAGAEGAACRQS HLKPvvSLVDVIiBBDKKIjBNEACAVLGGS 
DSKKCSYSQGSVKKQALYACSTCTPBGBEPAGICLACSYBCHGS 
HKLPBLYTKR 1TPRCPCGMSKPKHLBCT1 «T.PDKAKVNSGNKYHDN 
FPGLYCICKRPYPDPBDBIPDBWTQCWCBDHFHGRHIiCSAXPPB 
SGDFQHKVC^U>!KR< 3 PIA 4 AYAADI^VTKIST\© 01 ErffCGTXM 
B*/DJX^VIKPB»KSHQDSTTiKKDVPBQGKDErVRBVKVBQI«SHP 
CAGSSSBSDLQTVFHNESUiJ^BS2CSGCKIjQKLKAKQLIKKUTAT 
YWPUWRSKLCTCQDCMRMYBDLDVIiFI.TIlSYDTVLAYBNKGKI 
AQATDRSDPLMDTLSSMNHVQOVKLI C/G IQ* FED 



HLGGSE3LPPKKALP3QVASMNQHRVDFYI*A3IKDMI#VAX/GGRN 
KHGALSCVBTYSPKTDSW3YVAGliPRFTYGSAj3TXYKDFVYXSG 
GHDYQIGPYHXNliLCYDHRTOVHBBKRPMTTIUlOVHSHCSIiSDS 
XYSIGGSDDB IBSMBBFDVLGVBAYSPQCHQWTRVAPliLHANSB 
sgvavweghiyilggyshkntafsktvqvydreadkwsbgvdlp 

KAIAGGSACPIAP*SI^RTRKRKAKARGTRTGaSDPSCASWI3H 

PHBHLPGIiCRPAATS 



FPGRPTRPLKLGRRRKRAHI IQAPHOISPRPfiTCPPGALQAPBA 
PASRAEGPVAVWNOHTBGPAPARSAPKEPPGLPRPI>GSFPCPT 
P'QKDFPALGGPCPPHMPPSPGPSAWLLKGTPPPPPPGLVP prs 
KPPPGFSGUiPSPHP\PVSPAPPPPPPQX/RPRLLPAP/PGLPS 
PRELPGHSPSAHPVHQGItPAERHGPIjQRVQBPLRGVQTGPDLRS 
PVLQKLPGPAGGBFPBGI.* *AAGPAAH 



SPRMWBIWGPHHRWEEPSLKGBWPSRIPKPSPDSTHGTSGKGCB 
TVTCAVHHHIJSiHVAGIIPWVLHSQIiKPTAAT3U^>QWTSQQYPDH 
PTRLILQ* NQATADSNK* TTALLQPHQRL\ VS PHXAEA 



AHQI LTYLBP/ iCLWNYMKir.TVFLTKSVLSI *KFIHT 



iMIl 
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?* NDPPGIKBVYVSREtLRKTS F/RUWTFLBQAWS KBCVPVDQ 
?MEBL£J^liLSlJWSDPVPNVRVLIJ^KALSQMLtIjBKAYPRNA0KP 
KLBVXSET ILALQSDRDQDVSFPAALB PKRHHI IDTAVLBKQN 



BGAQVARRR3RRQWRAGRC6EGRQGSUtAKRTGGRGPPGRPRPl»P 
PGPAHRGHRRKKTPPYGDBALSGU3GGASGa3GTFASPGRLFPG 
A?PXAAAGSKMKKD^TLSLSBQVAAAtJCPAPAPASYPPA\ADG 
A?S3\APPDGLIASPDI/3 LLKIiASPKIiKRLI rQSKGLVTTTPTSS 
QFLYPKVRRSEBQHFAEGFVKAI»BDIiKg(yiQbGftQBAAftfiftAAR 

aggpsgtatcsappgbiapaaaapkapvya\nlssy\aggcrgl 

SGGAAT\VAPAABPVPFPPPPPPQAbGPHRP/RIjRLQGRHPQTV 

PDVP\SFGBSP\PLSPIBT\DTP5RI\KAXRKRIj\RNPQIHAPK 

PA3RKLGAQSRALKRKSEDPS*SPKFIGSLASTASLIJIKQVAQLK 

QXVLSHVHSQCQLLPQHQfVPAY 



TSVQCTS0OTPTSSQGSINMKHWISQAIHGSTTSTTSS6STQSO 
GSGAAHRliADVMAQTHIBNBSAPPDVTTYTSBHSIQVERPQGST 
GSRTAPKYGHRKU<KTraX3VPVSSRV3ftKI0QLVMTLKRPKRPP 
LRBPPVDDFBEIJjBVQQPDPHQPiCPBGRQMIiAMRGBObGVVTMW 
PP3LEAALQRWGTISPKAPCL ITIim’NUKPbYIIiTYGKl^TRSM 
HVAYSILHKliGTlOQKPMVRPGDRVaitVPPtgJDPAAFMAAFYGCL 
LAKVVFVPIKVPI/IHKDRGSQOIQFL1CSOGVTVALTSD7VCHKO 
M*XSPTC51PQFKEWPXLLWFVTRSKHLSKPPRDMP\PHIKnMf 
NDTAYIKV JfcQt^\DGSVIiGrVT\rTRTAIiTHCOAIjTQAOG YTHAK 
TIVMVUJPKKDVGLWHGII/rSVMNMMHVISIPYSLMKVNPIiSWI 
QKVCQYKAKVACVKSRDMHWAI»VAHRDQRDIHI*SSIiRMLrVADO 
AliPWS ISECDAFUiVF'OS KGLtRQKVT CPCAS S FEAL.TVAIRRPT 
DDSKQPPQROVl*SMHGliTYaVI RVDSKKKLSVLTVQjDfVQFbVMPO 
AI WCSVKPboVPQbCRTDKIGHBIjCVCAVATGTSYYQLSGMTKNT 
PEVPAMTSSGAPISEyPPIKTGIiLGFVGPGGIiVPVVGKMDGCLMV 
VSGRRHNADDIVATAI»AVEPMKFVYRGRIAVPSVTVLI£DERIVI 
V7VJEQRPT5STEKDS FQSJMSRVLQM DS IHOVGVYCXALVPANTIJ? 
KTPIX?GIHI»SEnCQLPliBGSLiHPClJVI^tCPHTCVTNIjPKPRQKQ 
PBIGPASVMVaHLVSGKRIAQASGRDLGQIBDNDQARKPIiPLSS 
VCjQHRAQTTPDill LYTLLMCRGAIAHS LTCVQIiHKRABKIAVML 
KHROHIjCDaDHVALVYPPO IDIJAAFYGCLYAGCVP ITVRPPHP 
CKIATTLPTVKMTVKVSRSACXmi^LICaa*LRSRBAAAAVDVR 
TYfPLILDTDD * PKJOU»AQlCKPC8PI3TLAYIiDPSV3TTT3 «IJU^ 
IWSHAATSAFCRSIKIOCBLYPSRBVTVICLDPYCXSIjCJPVLWCLC 
SVYSGHQSHI P PSBLBTNPALHLLAVS Q YKVRDTFC 8 YSVHBL 
CTKGI/3 SQTKS LKARGLDGUSRVRTCVVVABSRPRIALTQS PSKL 
FKDIiGLHPRAVSTSPGCRVHIjAJ CLQGTSGPD pttvyvdmralr 
HDRVRI.VERGSPH 3LPLMHSGKXLPGVRIXIANPBTKGPLGDSH 
LGBIHVHSAHHASGYFTIYGDBSLQSDHFNSRLSFGDTQTIHAR 
IuniGyLRRXELTDANGERHDAbWVGALOSANBLROmYKPXD 
IKTSVIRAHKSVTBCAVFTimJLLVVVVSLDGSBQKALDLVPIiV 
TfAnnURKHYLIVGVVVVVDIGVTPlHSRGEKQRMHLRDGFLADQ 
LDPIYVAYNM 



QLGAVAHAGVSRI PP*bFPPLHPTFL3LWCLHHKI*P/HPPGA5M 
VRPFWPRRPPAH I SSVRQASTQVPRTVPHTQHVAHIGTQTTC3P 
SGVGCCTPGRPLLPCKCSSAAHSTYRVQBPAVHl PGQEPLTASM 
LRAAPLHEOEUHIGKRLYPIxDfDVHTQLAGKITGHLLKlDNSRL 
LU4LBSPESLHAKIDBAVAVLQAHQANBQPKAVMH 



VCPGTGGB *GLKGQLGGI*PKBTPLKPMDAPTG£GLB3UCPDDVDV 
GSSVSNSDDKXSSSDSADSCDSLNPPTTASPTPTSILKRQKQLR 
RKNVRPIX}VTVyYFARRQGFTSVPSCGGSSLC3ttjQRHNSVRSyr 
LCgPAQBQBVNHRBIliREHbKEBKIfflRgKMKbTKHGXVESVKM) 
GLTLDDVSMDIDVENVEVDDYFF1X3PI*PT3®RHAI»LRASGVHR 
IDAEBKQELRAIRLSRETOGCDCRIiYCDPBACACSQAGnCQQVD 
RM5FP0GCSR0G0GHMAGR1 RFHP tRXnRlHYLHTIMKLSLBSKR 
Q\GAAQQPQ\*GALPDOQLQPDRSTGL*DPSWIGSHGX*SFTGKB 
AAATHLI ILRVXENRGABGKRK 
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no aexa segment containing signal pep 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B=»* 
Glutamic Acid, P»Phenylalanine, G=Glydne, 
H=Hiotidlne, 1^. Isoleucine. KaLysine, 
b= Leucine, M=*tethianine, U=Asparagino r 
P=>Proline, 0=Glutamlue, R^Arginirte, 

S=Serine , T-Threonine, VoValine, 

=Trypt ophan , Y-Tyrbsine, X=Unknovn, *=Stop 
Codon, /^possible nucleotide deletion, 

\ “possible nucleotide insertion) 



CLYKATQYKVtSPRKDaPDACYNPSS PAATTV FHIRTGLLLGDT 
Sia ITRTBBKBI PRQlTLRPnACAAIKSKKljRIGOGSl^ fc BRS * 
RVKNKYVCHESGVCXNt!AYWPCUI * AT*KKNKNDSVYTyQKGKAN 



RiT^vFnifTi y An a * u*oJD 



UKGHVHKCSPXPVPQTPYBBLHIiPAPELIJCKTKKIjFIXSI^ABKV 

IPLLMGTSCYVRGaTTIGIHCHPHBA*ELVPTDPAPDIIPI*KAB 
ASHP^VLKTSIIRQYCIARBGKDFTIPVGKPHCIGQKLYBSTTK 
TIT* *MHHTEXHPFSlCFSKLKTA*AHABfiH*DMTVPSGLY*IC 
RHRAYTOIiPNXHADSCVIGTIKPSPPIiPIKMGSLIfiPSVYASH 

kkkgivigwwkdhewpreriiqyygpaiwqogswgyr/tp/vy 

MLNVI IRl^QAlI£IISNBTCTALTVLAWQBT(P i roHArYCBJMAI* 

DYLLVABCX^OUCPNLTOCCXQIND<X5^^ 

IQVWHKPDPBSLPGKWPPAIOTTXTLTWSVIiLVIRTCLLLPCVL 

PLLFQWIKGrVATl>VHQKTSAHVKTMHHYBSISQRnSK£EDSSB 

BSH 



I a , j i. ' .u : 1 J . ' - ‘ i j g i w n 



EKGKVKDWlAEGHl3bSLSCWJSV<BTQ?SNULHSLTPPTSVRRM 

PUTTVTLLKMVWOTHKKIiLCSKAFSTQLQQKIPIiHSQIIGIHHD 
SVCHKLXPNTSHX ISII^GQEWUjVQLKTIjAPLTI I IQKPQTQD 

hmkfhxnlflhsbhltpsvpqtvipkktgsprixlkitktiqwg 

RELPBSSLOGDUUIRVQASB\Q*NQSIJBSRKKKRKKSHKEDSSa 

SKERKSHKIPKLEP r SKQNHPNKRVI7rVSKKPEEKP r VLKEGSPSS 

AHTIPCSNNGSVHffVPKPQVGfiDLVWSkyGmiWPCMVSSDPQL 

BVHTKIBTIH1ARBYBVQPFSHQPBRAWVHBKRVREYXI2IKQYBB 

LLABATmjASNHSBKQKIRHPRPQRBRAQWDIOlAUASBALKMT 

fiKBRIBQYTFIYIDKQPBBALSQAKKSVASXTBVKKTRRPRSVL 

rtopbqtoagbvasslsstbi rhhsqrrhtsabbkbpppvkiam 

HTAARKSIjPASITMHKOSIJJLOKCHMS PVVKIBQVPAUJNATG 

dgkfidqpvystkgignhthisvroqdeiliistpkqrhkkptqs 

VSSPBATSGSTGSVKKKQQRRSIRlllSKSKICSTBVVPKKKIKKB 

QVGPLHVBS 



MMTABfiRKATGLSPQAACBXDGlVaVKVEREDEBDHWHGQDSTL 
QDTPPPDPBIFRQRPRRPCTQllTPGPfiKALSRLKBLaiOWLRPR 
IHTKBQI l£bLVLBQPLSILP KELQWIUGBYRFDSGEBAVTLLB 
DLBLDL3GQQVPGCfV HS P B MLARGMVPljJ>PVQSSS3PDmHgAT 

Oshfkhssrkprzjosralpaahipapp&bgsphdoamasalft 

ADSQAK\nCI3DMAVSIJCKjBBMGOQHLARRN£*Std3Z7RQQnCfiAP 
PQGGEMRHHNKE5TSKABTSBIX3ASRGBTTGRS QKKF GKKRDQR 
GrTGKHQQXNPKEKTRKEKRIJSGPAIGKDXXTITaBRGPREKDX 
GLGRSPSbSSNyiTPKgVPTGTKSHRCDEOGKCFT R SSB U R H K 
X rHTGEKPYECSBCGKAP\SLNS \NLVLEORI\HTGKKPHECKB 
OCKAPSHSSNLIIBQRIHSGBKPYBCNECGXAFSOSSD\LTKHQ 
RnrTGBKPYKCSEOGKAPNRKSYIiILHRRVHTRKKPYKCTKCEK 
\AFTRSSTtiTLHHRIHARBRASEYSPASLDAPGAFLKSCV 



tAtTCBPPAPPVYQAPCQSCPBPPGAEIBP 
SDSPHHTPVHPPPEBSAACPAPATCCPP PRSSHS 



KHF3KPGSQALYQLKRPASGGWSISVMPAQKITKPAAKTGI PLA 
Y KKXGDKKL HBKKPl^KHKQAHQTPBKRVNTCKRRRKISBKAAR 
KHRLBFTEKRKKQKDQI 1 SLMKAEQMKKQHKERLEHXKRAREQG 
WENVLSAGGSGBVKAPPLGSQGTIAPSS PSSRGQYEHYHAI FDQ 
MQQQHAEDNSAX?naffiIYGR^BHQXIX)lAVSfiAKaVEEPr^R 
KHBAKQNKARABQ0^GIIj©IIAAMYG®IJ?S33SGGKPRNKEKBV 
YLARLRQIRIjQHPNBRQQIKAKLRGBKKBAHBSBGQBGSBBADM 
RRKK\ IBS LKTUIAKARAAVLKBOLKRKRJSA YKRK g KWBKH LV 
AKGVrSSDVSPPLGQHKTGGSPSKQCK<RSVISVTSALXEVGVOS 
SLTDfggTS K BWQ KTOMA ISSKRBI LRRmBWLKAQKUgKGgQW 
LSDrTFBINVHEDAKKHBKBKSVSSDRKKWKAGGQLVIPLDBLTL 
OTSreTTKianWSBVIHJGPSGSPHBAHGKSPTDSVLKILGKAB 
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Amino acid segment containing signal peptide 
UwHanlne, C=Cysteine # D=Aspartic Acid, B» 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=»Histidine, I = Isoleu cine. K^Lysine , 
L^Leucine, M-Methionine, H^Asparagiim, 
paProline, Q^Glut amine, R- Arginine, 

S=Serine , T=Threonine, V»valine, 

W»Tryp t ophan , T=Tyrosine, X=tJnknown, *=Stop 
Codon, /-possible nucleotide deletion, 
V-posaible nucleotide insertion) 
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LQLQTBbLKNTTIRSKI SPEGZKY KPblTGKKKVQCISHSINPS 
AIVUSFVBTKSPBFSRASPOMSliKLBCailJSBPDDLBTBrLQBPS 
GTBntDB\Si*PCTrnJVHISKBKBTKBTQSAl?RITIQHKBVSEDG 

vssttoqlsdihixpotndsqhskcdvdksvqpbpffhkvvbsb 

HUHjVPGVQSVQCSPBKSPAFRSHSHIjP P KHXKKHSIaLIGIjSTG 
LFZlANMPHNliitTCSliPDLSKIiFBTLMCVFlVGDVRQDBLBIDBI 
EDBHlKBGPSDSBDZVFESIDTDIOBLOASMEQLLRBQPGEByS 
pro^<iUf>inffawKUTAMC;TntfAngPIMPSSgaAlAiKBW3inBP 
CTTA < fP f> pr , pgmaiUT.irRT‘Pr^IJaQTO«PRKPPRWRgIKAIH8 
DKDKNIBICSKrVCKTIGNEHUHLYAKIIjHLVMADGAyQBDNDB 
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KilFSKFGSQALYQLKRPABGQHSISVHPAQKITKPAAKyGIPLA 

ykkygdbcklhkxkplqkhkqahqtpbkrvotgberrkiskbaas 

KHRIiBPIKKEKKQKDQI ISIKKAEQMKRQKXKaLHRrKRARBQG 
HHMVL3AG0SGBVKWP gUGSGGTlAPSSFSSBGQYBHYHAr POQ 
MQ0QRABnNK7UO7KRB I YGRGLPERQKBQIAVKRAKQVKBFLQR 
BCRKAHQHKARABGHMGHjQMLAAMTGGRPSSSRGGKPRHKEHEV 
IbARIiRQIRIjQNFHKRjQQIKAKLRGSKKKRNHSEGGKGSKKADM 
8RRK\ 18SLKAHANAHAAVLKE0I£SKRKBAXSRBKKVW EEHLV 
ARGVKSSEWPPbGQHBTGGSPSKQQWRSVISVTaALKBVGVns 
StTDTRBTSBEMQICtSKAXSSKRBIIiRHLNBHIdCAQBbBXGKQN 
LSDTFBXMVBBDAKEBBKEXSVSSDRKXHEAGGQIiVIPIiDBlilIt 
PTSFSTTBRKTVGgVIKLGPHGSPRRAWGXSPTDSVLICII/jBAB 
LQLQTKLLSHTT1RSBISPBGKKYKPLITGBKKVQCISHBIBJPS 
AXVDSPVBIXS PKPSKASPQMSIiKLBGHLEEPDDLBTEILQB PS 
GTNKDK\SLPCTITDVWISKBKBTKETQSAI>RITIQ^BVSBDG 
VSSTVDQLSDXHX BPCTNDSQHSKCD VDKSVQPE PPPHKWH9B 
S HLNLVPO^QSVQCSPBESPAHlSHSHLPPKNKNKHSIiLIGIiSTG 
j IiPDANHPKMUtrC^LPDIj6XbFRTXjMDVPTVGDVRQDNLBIX>BI 
BDENHCEGPSBS ED rVFBBTDTDLQELQASHEQLIiREQPG BBY S 
[ EEBBSVLKNSDVBPTANGTDV7\DEDDNPSSBSAbHEEWHSDNS D 
GBIASBCBCDSVraHLEBLRIiHljEQBMGFBKPPEVYEKIKAIHE 
DSDBKIBICSKI V®niX3^BHQHLYAKimi»VMADGArQEDNDB 


-i92S 

\ 


3 


1558 


bDPSMr^5^AyVAILIjJ?YVSRASC0DTFTAAVYEHAAXI»PHAT “ 

LTPVSREEAIALMHKHU)1Ij£GA1TSAABQGAH1 ivtpbdaiyg 

RHFNKD5LYPYUSDIPDPBVBI9Z PCNHRHRFGQTPVQSBLSCL\ 

AKNHSIYWANICTKKPCDTSDPQCPPDGRYQYirrDWF\DSQG 

KU/ftKXHKQNLPKGHHQFHVPKBPBIVrnirTTOSPGlFrCFDI 

MBDPAVTLVKDFHVDTIVFPTAWWnrbPHIjSAVBFHSAHAMGM 

RVHFlASKIHYPSKXMTGSGITAP»3SRAPHYDHK.TERGKLtLS 

QbDSHPSHSATVNWTSYASS rBALSSGHKBPfQ3TVFFDKFTFVK 

LTEA/AGNYTVCQKDMX3ILSYKMSQIIPHBVSWX3AFDGLHTVB 

GRYYLOICTLI*KCICrn?U?TCGI>SABTASTRFa i 4PSbSOTPGTO 

YVFPKVLLSE?*QLAPGBFQVSTDGftLP3LKPTSGFVLTVTLPGR 

LYKKDHASNAS8G^?A0ARIIMLIVIAPIVCSLSW 
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RQfCFHIVPPTNROKTGIjBDPKBYLiPVDIlAVIYNPATOAIWTAK 
KLVWXPS KRKGFEAAS I KKKRGDKVMVBIABNGKKAMVHICDDI Q 
XMMPPICPSKVBDHABLTCUfBASyLHHLXDBYyBGLIYIYBGLP 
| CWIHPYKHLPIYSEHXIBI'WRGKKHHEMPPHIYAISKSAYRCM 
LQDRKDQS IbCTCBSGAGKEBHTkKVIQYIiAHVASSHKGRKDHIf 
lPGB\LBBQLbQANPILB3FGH7kHTVCSJDNSSRPOKPIRIHFDV 
TGYTVGANl ETYliLEKSRAVRQAXDKRTFHI PYOLLSG\AGBHL 
K3DLI*L£GFHNYRFl*SNGYXPlPGQ\aDRGNPRGDPGKAMHi;P« 
FSttBB3DbSMI»KVVSSVIiQFGNISFKKBHNTD0ASMPBl)rrVAQKL 
CHIjUBWVMKFTRAIbTPRIKVGIUJYVQKAQTKEOADFAVSALA 
KA3YKRLPRHLVHRIHKAIJHmCRQGASPIGnJ)IAGP3IFBLK 
SFEQIiCXNYTNEKl>QQX*FNHl>!PILEQKBYQREGIBV7N?IDFGL 

dlqpcidlikrpanppgvlalldbbchppkatdktpvbklvqbq 

GSHSKPQKPRQLKDKADFCXIHYAGKWIjyKADBHliMKKMDPIiND 
NVATbiHQSSDRPVAKWKIWDRXVGLDGVTGMTBTAFGSAYKr 
KKGKFRTVGQIiYKESLTKIi^TLRHTNPN FVRCI I FNHRXRAGK 
LDPHLVUXJIiaCNGVUgGI RICSQGIWS rVTQEPRQH YBXLTP 
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HAIPKQPMDGKQACBHM 
LHKKROLKI TDI 1 1 FFQAVattJYLARKAFAKKQQQLSALKVIjQR 
HQAYLECLKHKQWWHVFTKVKPLMIVTRQREBLQAKDBBiiLKVK 
SKQTSVBGEIiKEKERKHQQIiIiKEKHIIiABQIiOTVETKLFAEABBM 
IQtfUiftAKKOKTtBBTLHI)T«fSKVBKBBBRMQILOHHKKKMQRH10 
DLEBQLDHBBGARQKLQIiBKV73tt£AKXKKNSI£IIaLLBDQNSKP 
IKBKKLMHDRlAKCSSQIABBKEKAKNLaKXHNKOHVMISDLEE 
RLOXBKIHQSL 8 KAKBIXOGBTrDLCE^XXSIQAQIDBLKI<}I» 
AKffRKKrjDGRLARGDDBTLHKHMALXVVRBIiQAQIflELQSDPES 
BKASRNKAKKQKRDLSHKLEAI*KTRLBDTM>TTAAQQSIiRTKRB 
QHVAKIJtKRLHBHTKMHHAQXQDKRQRHATAIiBBLSBQLEOAKR 
FKANLHKNKQ0IjBTIINKKIJtf3VKVL<X3VKAKSBHKRKXIJ>AQV 
QBLHRKVOTGDRLRVBIASKASHIXJHKLDNVSTLIiEBABKKGIK 
RlKDAASLBSQIiQDTQKLIjQSETRaKIjJNLSSaiRQIiERFKKSLQ . 
BQQKBKSHARKHIiEKQVLALQSQIAiyrKKKVDDDlJGTIKSUKKA 
10CKIiUa3AEALSQSL£BKMAYDELEXTKNRLQQEZJ^LTVDLO 
HQRQVASNLBHKQ\KKFDQI*IABKKSISARTAEERI51AEAKARB 
KBTKALSLARALBBALEAKBBFBRffiSKQUtADNEDIMSSKDDfVO 
KNVHZUiKKSKRALBQQV\BB>lHTQLEBLKI)HLQATBDAKliRItSV 
NMQAMK^FHJU)LQ^RDEQHBBKKRL1>IK0VREI*HRKLBDBRKQ 
RALAVASKKKMBXIJLKDLRAQIBAAHKARDBVIKQIjRKLOAQMK 
D^RBIiBBRRASRDBIKVQSKBSBKKW3I£REIljQLQKBLftSS 
BRAREHAEQKRDKLADBITNfiASGKEAmJEKRRLSARXnQLES 
ELKBBQSNMELLIflHFBKTTLQVDTUSlABLAABRSAftQKSDNAR 
QQLERQNKBLKRJCLiQKLEQAVKSKFKA'rrSALEAKIGQXjHEQLB 
QRAKBEAAANKLVRATKKKIdCSI FMQVKDERRHADQYKECPIBKA. 
NARMKQLKRQ1*KEAEEB7\TRAHASRRKI»QRBLDI)ATEANEG1*SR 
KVSTLKNRUtRGGP ISFSSSRSGRRQLHLEGASI>EI^DDDTKS K 
TSDVNHTQPPQSB 



] RGNCFW IVPFTMAQRTGLBDPBRXLFVl>RAVirKPATOADWTAK. 
i KLVWIPSERHGFHAASIKBKEtGDBVMVELAEMGKKWIVNKDDIQ 
KMMPPKPSKVBDMAHLTCUffiASVmiIja>RYYSGI»lyTYSGLF 
OArD^yKNLPrySENXIBMY 3 W 3 KKRH®lPPHIYAJ 9 S 3 AYRCI 1 
LQDREDQS IIiCKJBGGAGKTKNTKKVI QYI*AHVA3SHKORKDHN 
XPGB\LRRQlJjQflypniESPGHARTVQHDl?SSRPGKF£RlNFPV' 
TGYWGAHIETYIiLRK3RAVEQAKDKRTFHIFYQIJ>SG\iU3BHL. 
K3DU JjBQPHWYRFLSHGYIPI PGQ\QDKGWPRGDPGRAMHIMG 
PSHKKII*SMI*KVVSSVlX2EXjir[SFKKERC/TDQASMPSNTVAQKL 
CHLLGKNVI® FTRAI LTPRXKVGRDYVQIAQTKKOADFAVBAIA 
KATYBRLFRffI*VHHJHKALDRTICRQaASFIGILDXRGPBIPKIJ? 
SPEQLCIHYTNEK3>QQLFNHTHPI1J3QBBYQRHGIBWHFXDPG1» 
DliQPdDL iKRPANPPGVLALIrDKSCWFPKArDKTFVEKLVQBQ 
GSRS KPQKPRQIiKDK^PClIHXAGKVDTKMSPilljPlBCHMDPriND 
NVATr»LtfillSSDRPV7iKUfKI3VDRXVraLDQVTGJfl^APGSAyKr 
KH X ^y RTVGQLYKESLTKI^C^TLRHl’WPKFVRCl I PNHEKKAGK 
LOPHI.\rU3QIiRCSGVlEGrRICRQGFPRRIV70BPRQRyBrLTP 
KAIPKGFMDGKQACBSKIRAXiBLDPNLYRlGOSKI PPHAGV1AH 
LBKERDLRITDI IIPPQAVCRGYIARKAFAKXQQQLSAIjKVbQH 
NCAAYIJCliRHVfQWTOVFTKVKPLLQVTRQBBKLQAKDBBLLKVK 
BKOrncVBQBLKEHSRKHQQXiLRKKHl LABQ WftKTKLPAEAEBM 
RARLAAKKQELEBlXiRDLSSRVBSBBERNOIbQHEKKKMQAHIO 
DLBBQIiDKBEGARQKLQIiKKVTABAKIKKHBKKrtiLHDQNSKP 
IKBXKLMKDH IAECSSQUVEKEBKAKNIAKIRHRDBVMISDLEB 
RLEOCBEKTRQBLBKWCRKIJJGHTTDIjQDQIABIjQAQIIJELKLQI. 
AKKKEBIiQGALAftGDIlBTLHKNKALKVVREL^OIABIiQKDFES 
RKASRNKAEKQKRDLSBELRMaKTELKDTLDTTAAQQELRTKRB 
QBVABX>KKftLEEBTKNHEAQIQX34BQRHM3&EELS£QLEQhKR 
PKWnJ5KNKQGLBTIR?KBIiACKVKVIjQQVK7^EHKRKKIiDAQV 
QKUlAKVSBQDRLHVEIABKASKIXJHBIiENVSTlALBKARKroiK 
PAKDAASLKSQLQDTQRLLQKBTRQKL^nUSSRIRQLKSKKNSLQ 
gi ^H Fp1?RAPlon,Kfq}^-^ T <y^^t>TTCKKVnDDIirrTKST.KFtft 
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Amino acid segment containing signal peptide , 
{A=Alan±ne, CsCysteine, D^Aspartic Acid, E® * 
Glutamic Acid, F^Phenyl alanine , G-Glydne, 
H=Histidine, I^Isoleuctne, K=Lysine, 

L° Leucine, M®Kcthionine, H=Asparagine, 
P-Proline, Q=Glutamine , R=Arginine, 

SaSerine, T=Threonine , V=Valine # 

HaTryptophan, T=Tyroaine, X=0Qknoim, *=Stop 
Codon, /=possible nucleotide deletion, 

\ impossible nucleotide insertion) 








HQRQVASKLBKKQ\ RKFDQLLAEBK3 ISAR YAKERDHAEABARE 
KBTKAI^IARALERALBAKBBPKRQNKQLRADWEDLMSS KDDVG 
HHVHELEKS KRALBQQV\EEMRTQ LKKLEDBLQATKDAKURLEV 
MMQRKKAOFBRDLQTHDBQHBEIDCRIiLIgOVttEIgAKLEDBRKO 
RMAVASKPOHSIDLKDIJAOIBAAHKABMVIKOLttKLQAQMK 
PTQRKUggABASM)BlPAQSKESBKKIJCaT.BaBTT4)L0B3LASS 
BSARRHABQERDSLADBITBSASGKSALLDBXRRIiBARZAQIiBB 
RTtPHBQSHMBLI/niHFRKTTLOyDTLliAKLAABRSAAOKSDNAfi 
QQXiBRQHKELKAKLOSLEOAVgSKPHATlSALBMCtGOGEBQLB 
QEAKBRAAAHKLVRRTKKKLKEI FKQVKDBRKHADOT1CEQMBKA 
KARMKCLKRQLBEAKKKATRANASRRKLQSBLUIiATKANBGLSH 
BVSTLKNRJjRRGGPISFSSSRSGRRQLHLBGASLKLSDDirrKSK 
TSDVNKTQPPQ3B 


5932 


33 


S72 


"RHLBEICpEFEQWSRKLKLSGPHWKKGKPRGTGGUIVKABANKtl 
FGATLAVGLTIFVLSWTII ICOTCSCCCLYKTCKRPHPV\APP 
FWPP/PWHAPYPQPPSVPP5YPGPSY00YHTMPFQPQfPAAPY 
PMQTPPPYPAQPMGPPAYHKTLAGGAAAPYPASQPPTfflPAYMim 
PKAAL 


5933 

i 

i 

i 


1 


3190 


GTRKLK>1ADKTPGGSQKASSlcrRS9BVHSSGSSnAHMnASGPSD 

sdmpsrtrpksprkhnyrneslarbslqjsphgnlsrpllehklk 

APSIGKMSTAKRTLSKKEQEKLKKKKDBKAAABrsfKKFLAAFBQ 

EDGHKVKTFVRGGWNAAKBBHBTOEKRGKIYXPSSREADQKNP 

PHQSSBERPPSLLVIBTKKPPLKEGBKBKKKSNLSbFKEELKQI 


i 

i 


1 

1 

i 




QBBBDBRHKTBGRLSRFBPPQSDSDGORRSMnAPSRKNRSSGVIj 
0DYAPGSHDVGDPSTT\HFYLGN I \HPQMHLKKCCOQBFGRFGP 
IASVKl>MPRTOEKRARKMfCX5FVAFMNRRnAERAliKNIiB5KMX 
MSFEMKbGHKGKAVPIPPHPIYXPPSMMEHTIjPPPPSGLPFNAQP 

rkrlknpnapklpppknkedfbktlsqaivkvvipteriillali 

HRMIEFVVRBGPMFKAMIMNRBINNPMPRFLFBWQrPAHVYYRW 


< 

i 

i 

1 






KLYSILQGDBPTKHRTBDFRMFKNGfiFWRPPPLNPYLHGMSBBQ 
BI^PVBBPSEEGALKBKQRDKLKEILRGLTP&KNDlGnAMVFC 
LHNARAASEIVDCITBS LS I LKTPLPIOaARLYLVBDVLYKSSA 


i 

i 


i 

r ■ 

i 




KVAHASYYKKFFKTKLCQI F$ DLNATYRTI QGHLQSKNFKQRVM 

tcfrahkohaiypbpfi»iklqmifixjlvntikbkktkdvpddld 

GAPIEEKLDGAPLRDVDGI prDATPIEOrjXJVPIKSLDDDLDGV 
PLDATSDSKKNBP I FKVAPSKNBAVDBSELBAQAVTT5KHELFD 
QHKESKKBBMQNQKEBSBDKKirrQSStSRHHHLYSHPIKKEMTB 
SKFSKYSEMSKKKRAKLRBIBLKVMKFQDELESGKRPKKPGQSP 
QEaVBHYRDKLLQaEKEKELERERBRDKKDKEKLE3RSKDKKKK 
D3CTPTRKERKRRHSTSPSPSRSSSGBRVKSPSPKSKRSBRSKR 
SHKESSRSRSSHKDSPRDVSKKAKRSPSGSRTPraSRRSIlSRSP 
KKSGKKSI^QSI^PHRSHKKSW3KTCiTQiKPFICKAVTYWKC3>LP 
LCPBRSVF 


i 

< 


i 

i 


3190 


GTRKLKMADKTPGGSQKASS ktrssdvhssgssdahmdasgpsd 

SDMPSRTRPKSPRKHNYRNBSARESLCDS PSQNLSRPLiLEHKLK 
APSIGKMSTAKHTLSKKBQKKLKKKBDBKAAAE1YBBFLAAFKG 
SDGNKVKTPVRGGWNAAKBEHBlDBiCRGKIYKPSSRFABQKNP 
PHQSSNBRPPSLLVIBTKKPPIiKKGBKBBaCKSNIiKLFKEKLKQX 
QSERDBRHKTKGRLSRFBPPQSPSPGqRRSMDAPSRRNRSSGVIf 


i 


] 




DDYAPGSHDVCTPSTT\MFYUH«l\NPQMNLKKCCOQBPGRFGP 


i 

i 

l 


i 

i 

j 

i. 




iasvkimwprtbbkrareehogfvaphnrrdahhalknlngkmi 
MS PH* KLGWGKAVP I PPHP IYI PPSMMBHTLPP PPSGLPFNAQP 
RKRLKNPHAPMLPPPKNKKDFKKTLSOAIVKVVI PTERNUiALI 
HRMXBFWHEGPMPBAMIMNBBIffliPMFRFLFBHQTPAHVrYHW 
KLYSnXy^PTKMHTKDFRMFKNQSFffRPPPlNPTLHGHSEBQ 
BTEAFVBBPSKKGALKBeaRDKLEBUiRGLTpRKNDIGnAMVFC 
LHMABAAEBrVDCITBSLSI LKTPLPKBCIAfiLYLVSDVLYNSSA 

KVANASYYRXFFBTKLCQn^DLNATYRTICXSaLQSBHFKQRVM 
TCFRAWBDWAI YPB PFL IKLQN I FI/3LVHI I SEKBTBDVPTJDLD 
GAPIBBBLDGAPLBDVDGIP I DATPXDDLDGVPXKSXiDDDIiDGV 
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!>IjOAraX)SKKNHPIPKVAPSKWSftVr)KSKf.RA 0 AVTTSK>#SLFD 
QHEESKEEKMQNQKEBSSDHEDTQSSKSKBHHL.YSHPXKEEMTB 
£KP6KYSBKSBBKRMCLRB1BI»KVMKPQDBI£SGKRPKKPGQSP 
QgQVBHYRDKLLQRBKBKBLHRKRE8DKHDKKKliBSRSKDKKEK 
DECTPTRXBRKHRHSTSPSPSHSSSGHRVKSPSPKSKRSBRSKR 
SHKBS5RSRSSHKDSPRDV3KKAKRSPSGSHTPKRSRRSRSRSP 
KKSG KKSRSQSRSgHRSHKKSBGOTSnGMCPFBnOWTYWMCDLP 
LCPEHSVP 



SYMIiSGWRI*SHPKlQPHJ^HHGI®FGTHAPVHQDDCB IGASAL 

SDSGSiVSSBARBBEKSKBSRQEALBRIJXAXAOERYKYSVBDF 

TGVYBBVDKBQ YSKLVQARQD0D5n VHDDG IGYVEDGRB I FEDD 

Tjmn&TJia nynfKimGKARllKDggHVXfn^^ 

GOCTADlWVDLSKDGLLGDIbQDlJirrBTPQITPPPVMIIJCKKRS 

IGASPNPFSVHTATRVPSGKIASPVSRKKPPLTPVPljKEAKPZiG 

D0VQVESTKEEQSSGAMHPKDGDPDHIWHVHBVDLRPKAAKAHD 

keskpahrvkqsadsgkgtvsylgsflpdvscijdidqbgdssfs 

VQBVQVDSSHIjFI*VKQADBBQVFHFYWU1AYBIX>YNQPGVVFIiF 
GKVWIBSABTaVSOCVMVKNIBBTliYFLPHEMlCIDLMTGZBTGT 
fiskkdvybefdskiatkykimkfkskpvhknyafbxpdvpbks 
BYLEVKYSA£MPQI*PQDLXBSTFSU V FC5THTSSI»KLFIjMHRKXK 
gpcwi*bvkkstaujqpvshckvkamalkpdi»vn V 1KUVSPPPLV 
VMAF3MKTMQNAKNHQNBI IAMAALVHHSFALDKAAPKPPFQSH 

fcwskpkdcifpyafkkvxbxkmvkvkvaatkrtllgpflakv 

HKI DPDIIVOHNITCF^LKVLLQRIHVCKAPHWS KIGRLKRSNH 
PTOJGGRSGFGBRNATCGRMICDVBISAKEI*IRCKSYHI*SKLVQQ 
LLKTKHWIPKEHI®JHYSKSSQIiLYLLiEHTHKDA\KF 1LQIMC 
BMIVIJ»UlLQITEaiyGNIWSRTI«3aS3ERNBFXajiHAFYBI^ 

ivpdkqifrkpqqklgdbdkbidgdtnkykkgbkkgayagglvl 

DPKVGFYDKFIUjLDENSLYPSI iqefnicfttvqrvasraqkv 
TBDGEQSQIPKIiPDPSIiKMGI LPRKIRKLVKRRKQVKQLHKQQD 
l^PDLXt^YDIHQKALKLTANSMYGCLGFSYSRFYTVKPlJlALVT 
YTOIRKIIJ^IJTKBWVQiGMICLKVXYGDTDSIMINTNSTNIiEKVPKIj 
QJKVTCSBVNKLYXXiBIDIMfVFKSLIjiJjKKKKYAAIiVVHPTSD 
(3JYVTKQKUCGI^IVRIU)1fCDLAKDTGHFVIGQIIiSDQSRDTlV 
ENIQKRLIBIGENVIiNeSVFVSQFHISKALTKDPQDYPDKKSLP 
HVBVAKMXK5QGGRKVKnGmTVSYVlCQPGSNI>TASQRAYAPBQ 
LQKQIXSLTIIxrQYYI»RQQIHPVV3UlICBPlDGIDAVI>XATGWEL 
VDPTQFKVHHYHKDKKHDALUSEPAQIiTDKBECYMXZBRFKCK^ 
TOGTKMI YCHJVPDC3SGTDMBPSUffiCSIIXDCKASPLTCTV£}LSN 
KLIMDIKRPIKKYYDGWLICKBPTCXHHTRHLPLQPSRTCPLCP 
ACHKATLQPEYSDKSI*YTQLCFYHYIFDABCM*BKLTTDHBKDK 
LKKQPPTPKVI^YRKIiKN’IABQPLSRSGYSBVMLS KLP7VGCAV 
K3 

p ngW QvnRRRPPPArr^FriKRIiQKPSRLJiRAVHRSRAHTCYLAI 

RHU'lATCCPSPTTBlCrFGPIIQRAPPlJUjliVQKRBADSSGIiAFAS 

HSLQRRKICGLIJ^SPVAPLRTRPPLLlSLPQDPSQVSSVXiyVDLl* 

PETHRHVRtHKM£3Sl>RPbGPTIMQ<SVRVAPQG\LBRVPGIPI 
SRl*VRGGI»ABSTGtlAVfiDBII*BVBSIBV!RDK3!IlfQfVTDMMVAK 
SHNXLXVTVKPftNQHHNVVRGASGRLTaPPSAGPGPAEPDSDDD 
SSDLVXKNRQPPSSEKJLSQGP PCTDLH PGCRHPGTRSSLPSLW) 
QKQASSGWGSRTRGDGSGPSL 



PTSI*LKSTVQ1WC3RI*IjQDKRYQCVYSIJVBI 
YGLTSSYSLW5IMLRSSLKQYS PKALRBKSMYSDI PDVKNDFAPI 
LHLRDQYDPI>YSKRPSIPLS8VSSNKI*K0IHLSNBWTVEK1iKSK 
LVKNRCPKXBIifliFNIjBGMDNVPBLTBMBVLSLEI.1 PBVKLPS 
AVSQLVHI»KBLRVYHS9LVVDHPALRPI*KEHLKIIilLKFTQCK 
IPK>fVFHLKHI*KKbYI»SGCVIiPBQLSTMQliKG PQDIiKNLRTIiYL 
KSSLSRI PQVVTDLIiPSIOTLSLDHKGSKljVVIiJNLKKMVNXiCS 
USLISCDLERIPHSI PSUOnJffiLDLRKNMLJCIVKBI ISPQHLQ 
NLSCLKLHHNUIAYI PAQIGTO^NLEQLSLDaNNIENLPLQLFI* 

CTKLHYLDLSYNHLTPIPKBIQYI*\SNLiQYPAVTNNNI 8MLFDG 
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MpH 








LFQCKKIiQCr«IjT.fiKWSLMNLS PHVGBLSHLTHREP IG \NYLETb 
PPBLEGCQSIiKRNC2jIVEENIJ2JTLPIJ*VTIDU£TCIiDKC 


5938 


395 


1865 


YKGEGFFC2JQEAHGEfiHKKKKAMSSPHIWSTGSSVYSTPVFSQK 
HTVWI LLLLfiLITCFTSQKSDDDYBDXASHKTHVLTPKVPEGDV 
TVXLNHLLEGYDNKLRPDIGVKPTIjIHTDHYVHSIGPVHAIHMB 
YTIDIFFAQTHIDHKIiKFHSTlKVIiRIMSIDfVTHIHIPDTPFHH 
SKIO^IH^HHrTTPNSMlJirNNDGSVIiysijHLTI DAECQUQX^NPP 
KDHHSCPLRFSSYGYPHSEIVYQffJCaSSVEVGDTESKRI.YQFSP 
VGLRHTTBVVKTTSGPTVVKSVYPDLSRRMST FTIQCT IPCTU 
WLSMVSPWINKDRVPARTSIjGITTVI>TKm>STXAHKSLPKV3 
YVTRHDIiPVSVCPIPVPfiALVBYG\TTjHYPVSHRKPSKDKDKKK 
KHPAPT1DIRPR3AT1QMHNATHLQBRDBHY0YECLDGKDCASP 
PCCPBTCSTGAWRIJGHIHIRIABWDSYARIPFPTAPCI*PHI»VYW 
VSYLYL 


5939 


66 


1404 


IRPGYIiKBVQKNSPGHRAGLBPPPDPlVS IWGSRI>WDHimJCD“ 
LLKANVEKPVKMLXYSSKTLKLKETSVTPSNLIJGGQGLLGVSrR 
PCS FDGANKNVHHVLHVKSHS PAALAQLRPHSDYI IGADTVMNB 
SBDl#PSLIBTHBAKPI»KLYVYMTDTDNCRByi ITPHSAWGGBOS 
IX3CGIGYGYLHRI PTRPPEEaKKISJLPG^IAGTPn’PIjKDGPTE 
VQI*S8VHPPSLSPPGTTGIEQSLTGI*3XS3TP\PAV3SVLSTGV 
PTVP\LLPPQVHQSLTSVPPMBSSYl^LiPGLHPPTRQGIiPHLPQ 
PSTPHLPRNPTHSWPGVGIjTQBFVKPGVIiPPLSSMPPRNIaPG\I 
APItPXiPSKFIiPS PPliVPBSSSAASSGBLLSSLP PTSNAPSDPAT 
TTAKAHAAS KI>TVDVTP PTAKAPTTVEDRVGDSTPVSHKPVSAA 
VDAHASBSP 


5940 

i 

i 


145 

I 

1 

i 

i 


717 


RRSASRSASPRQSAGTAVTTOTOAOGTCIiARnHHIUiRWRADGRS 
LBKLPVHMGI>VTTBVEQBP3PSDIASLWHCPtAVGISYISVYDH 
QGIFKRHHSRLMDBHjKQQQBIiLGU>CSKISPBFAH5NDKDIX3V 
I2?CHLAVKVI*S PBDGKAD XVRAAQDPCQLV7VQRQKRPTDLDVDT 
LA\VYLVQMWI»II*I 


5941 j 


i 15 


6147 


>KMR«GASSPfiSPBPVOPPAP^PFCCGGSLIAVVVLlJtfiPVA“ 

WGQCNA?BW\LPFARPTHLTDKFBFPIGTYLNYBCRPGYSGRPP 

SIICLKNSVWTCSAKDRCRRXSCRNPPDPVHaWHVIKaiQFOSQ 

IKYSCTKDYRM03SSATCI ISGDTVIHDHBTPICaJRIPOGLPP 

TITHGDFISTNRKNFHYGSVVTYHOTPGSGCTKVFKLVCTPSIY 

CTOTDDQVOIWSGPAPCCIIPNKCTPPHVBHGILVSnHRSLFSb 

HBWRPRCQPGFVMJOGPRR VKC3QALNKWBMIiPSCS RVOQPPPD 

VU1AEBTQKDKDNF3 PGQBVFYSCBPGYDUBGAASHRCSrPQGDH 

SPAAPTCBVKSCDDFMGQMiHGKVLFP'yNLQIGAKVDFVCDBGF 

QUOSSSASYOrLA^ESJbWNSSVPVCBQIPCPePPVIPHGRHTG 

KPLBVFPPGKAVNYTCDPHPDSGTSPDLIGESTrRCTSDPQGNG 

VHSSPAPRCXSILGHCQAPDHFLFAKLKIQTHRSDFPIGTSIiKTE 

CRPEYYQRPFSITClJ3NLVW33PKDVCKRK5C3CrpPDPVNGMVH 

VITDI<]fVGSRINYSCTTGHRI*IGHSSAECILSQJAAHHSTKPPX 

CQRXPCGLPPTIANGDFISTHRSNFHYGSVVTYRCHPGSGGRKV 

FELVGB PS I YCTSHDDQVGIHSGPAPQCI rPHKCTPPNVBHGIL 

VSDHRSLFSLNBVVKFROQPGFVMKGPRRVKEQALHKWEPBDPS 

CSRVOQPPPDVLHAKRTQRDKDHPBPGQBVFYSCBPGYDlllGAA 

EHRCTPQGDWS PAAPTCEVXS C3>DFMGQLLKGRVLFPVNliQ]XA 

ECVDFVC3)BGFQI^3SASYCVLAGMEStLWHSSVPVCBgiFCPSP 

PVXPHGRHTGKPLKVFPFGKAVNYTCDPHPDRGTSFDL.IGBSTI 

RCTSDPQGNGVMSSPAPROGIbGHCQAPDHPliFTlKLKTQTHASD 

PPIGTSLKYECRPBYYGRPFSITCU)NI»VWSSPKDVCKRKSCKT 

PPDPVNGWVHVITDICJVGSRINYSCTTGHELICTSSAECILSGN 

TAHWSTKPPICQRI POGLPPTIAH(mFISTim3NFHYGSVVTYK 

chlgspgiucvfblvgbpsiyctshddqvgiwsgpapqci ipnkc 

TPPNVEHGILV3DNESLFSLNKVVBPRCQPGPVMKGPRRVKC0A 
MKWEPBbPSCSHVCQPPPBILHGBHTPSHODHFSPGQBVTYSC 
BPGYIILRGAA8LHCTPQGDWS PBAPRCAVB5CDDFLGQLPHGHV 
LFPLHL0DGAKVSFVCDBGFRI»KGSSVSHCVf.Vl3OlSLIiNHSVP 
VCKHTFCPNP PAILHraiHTGTPSGD I PYGKB I SYTCDPHPDRS4 
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TFinjGKSTIRCISDPHGNGVWSSPAPRCELSVRAGHCKTVEQF 
PFASPTIPINDFKFPVGTSLNYBC31PGYFGKMPSISCLENLVWS 
SVKDNCRRKSCGPPPKPFNCMVHIIHTDTOFGSTVNYSCSrEOFRL 
IGSPSTTCLVSGMNVTWDKKAPICBIISCBPPPTXSNGDPYSKN 
RTSPHHCTTVVrYQCHTQPDGBQLFKLVGBRSrrCTSKDDQVGVM 
SSPPPSCISTOKCTaPBVKHAXRVPGNRSFFSIiTBXIRPRCOPQ 
FVNVGSSTVQOQIlSGRfilGPKLPHCSRVOQPPPEI LHGSHTIjSHQ 
rtfFSPGQBVT?SCBPSYDl^GAASI^CTPQGDWSPKAPRC3rVKS 
QI^FbGQIiPHGRVIiLPLHLQLCAKVEPVCDKOPRIJCGHSASHCV 
LAGNKAU2KS5VPVCBQI FCPHPPATLNGRHTGTPLGDI PYGKB 
VSYTCDPHPDRGHTFHL1GBSTXRSTSBPHGHGVWSS PAPRCKL 
PVGAACPHPPEtOHGHYIGCSIVSIiYliPGHriSYTCDPGYLLVGK 
OPIFCTOQG nfSQLDHYCKKVNCS PPLFMNG I S KKLKMKKVYHY 
CTYVTLBCBPQYTLBGSPWSOOOAPDRWDPPLAKCTSBT H DflLl 
VGTLSGTIFFILIjIXFLSVriILKHRSnNKftHEdPKKVAIHLHSQ 
GGSSVHPRTUrniBSNSEVIiP 



YLYVRMRANPlAYGISHKAYQI DPPI* \ RKKREQ\I*VXB \VGRKL 
DK\AQ>5IRPKKRTGYPSSTOIiC3?TA£HYYXKYliTIBTPNKLFDA 
HKTHGDIPAIVSKAKKPDQIKVRREEXBKLimiLSNPCRLSTPG 
GVEKSYGKIH1 LUrrYINRGS>fl3S PS LISDSAYV7U3NAARIVHA 
LFBlAJjRKRWPTHTyRIiliHLSKTlXDKRWfGWlSPLRQFSIbPPH 
MLTWoKBKKLTVDKIiKEMRKDKIGHl UOIVNIGI,KVKQCVHQ I P 
SVHMSAPIQPITRTVLRVTLSIYADFTWNDQVHGTVGBPKWIWV 
BDPlHnHXfHSKYFIALKKQVI SKBAQLLVFTZ PIFEPLPSQYY 
I RAVSDRWDGASAVCI IKPQHUXPHRHP PHTKLLDUJPIiP ITA 
LGCKAYEALYHFSHPNPVQT0IPHTtYHTIX3JVlXGAPTGSGXr 
VAABLAI FRVFNKYPTSKRVYIAPliKAJLVRKRHDDWKVRl EHKL 
GKKVIELT6D\^PDMK51AE3U>LXVTTPEKHDGVSRSHQHSNYV 
QCfVTlLIIDBXHlkLGEBROPVLBVIVSRTNFlSSBTSKPVRIVG 
I LSTAIANMDLADWI^^ PLSVHIOGFPGQ 

I HYCPHM&SHNKPAFQAIRSHSPAKPVLI PVSSRRQTRX/TALKIil 
A?UVTEEDPKGMUSKDERBMEHI I ATVRDSNLKLTIaAFGIGNHH 
AGLHBRDRKTVHBI*PVNC[KVQVI*IATSTIAWGVNFPAHLVI I KG 
THYYDGKTORYVDFPITDVIjQMHGRAGRPQFDDQGKAVXLVHDI 
KKDFYKKFLYHPPPVKSSLLGVI>SDHLNABLAGOTIT3KQDAL»D 
YlTVTTYFFRRLlMHPSYYCIbGDVSHDSVHXFLSHb I EKSLI ELK 
LSYCIRIGraaNRSIBPLTrGRIASVYYIiKHQTVKMPKDHIiKPBC 
STHSLLSIIiSQABBTTDLPVRHKBDSHNSKLAXCXiPIBSNPBSF 
DSPHIBAaXiUX2AHLSRAHLPCPDYini>TKTVIiDQMJlVOCiAML 
DVAflNQGWLVT V LN 1T K L I QMVT QGRWLKD33LLTUPN I ENHHL 
HLPKKWJO>IMKGPHAIlGfiTSIBCLPELIHAOGGKDHVPSSWBS 
RLHRAKTKQAWNFI^HLPHIHVGrSVKGSWDDLVEGHNBLSVST 
LTADKRDDNKHXKLHADQBYVIjQVSLQRVHFGPHKGECPBSCAVT 
PRPPKBKDEGWFLIbGKVDKRKLIAUCRVGYIRNHHVASIiSFYT 
PBIPCTyiYTLYFMSDCYLGLDQQYD/MLSQRYTSBSPCTGQHCl 






DKPTRHKTn*SSSMiUmAAABGPVGIK3EIJI 
FXSXjQJQSPTKAEKPASSSLPSSPPPQIiL'TRKVVFGbGGKLPLHD 
gkdssflwrlrgpsggg\kepalsqyqrllcihpplpbiyqvl 

LS PTQHHV7UjIGTKGIliVIiKLPKRWGKN3BFBGGK£TVHCSTTP 
VABRFFTSSTS LTLKHAAWYPSRI LDPHWLLTSWfVXRXYSLR 
BPQTPTNVXIXjSJSAHBESLVX^KGRAYTASLGBTAVTVFDFGPLft 
AVPmiPGQKGKDBVVAYPLYILYENGETFLTYISLLHSPGN/I 
WKAVGSIAHAS\AAKDKYGYDACAVLCLPCVPNILVXRTKSGML 
YHCVVLEGBKKDDHTSBKSHDSRIDLrPSLYVPECVELELALKL 
ASGEDDPPDSDPSCPVKLHRDP KCPS RYHCTHEftGVHSVGLTWT 
HKI>HKFr^DHKDKI)SLQBr»STBQKCFVEHlLCTKPl,PCRQPAP 
IRGPWrVPDILGPmtdTSTYBCLIWPtJiSTVHPASPPIiLCIR 
EDVEVAKSPLRVLABTPDSFBKHXR5 IUiRSVANPAFLKASEKD 
IAPPPESCLQLLSRATQVFREQYIIJCQDLAKEEIQRRVKIiLCDQ 
kkkqlkdlsycrbbrkslrbmakriaokyekakbkqbdtmnrmk 
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KDYQQQKMKKVX^LPKPTIILSAYQRXCIQSILKKKGEHIRBMV 

KQINDIRNHVOT 



PS lATFTDBPBVLTBPPSATTTTTIGISAXWrTLftGSHGKRNNT 

rrrrss krkmnkitpknvqiifddplpisysqpbkvngbs ks 



■‘.'j r I »vi : t-1 y;W#« 4 t/->j 



KKQPSVLVTPPKBBRXSVSGKASIKLSBTISBGTSNSLSTCTKS 
GPSPLSSPKGKLTVASPKRGQKRHEGWKBVVSRSKKVSVPSTVX 
SRVIGRGGO^INAIRBPTGAHIDTDHQKDKTGDRIITIRGGTBS 
TRQAT0LINAI*IKDPDKBIDKI*XPKHRUCSSSA1SSICIGSSAPTT 
TAAHTSI^fGIKMTTVZ^LSSTSQTAIAilVPAISSASTUKTlXNP 
VH\|iVRPGFPVSFP\t»RYPPPQFRHRU»RRQrPCX)lRPPRI»PMT 
HFGCTPPPAQSTWGPPPVRPI>SPARATRSPKFHMVPRHSHt^SS 
GSOVNSAGSLTSSPnTTSSSASTVPGTSTBGSPSSPSVRRQLP 
VTWKTSHAITTTVTTEASHHHTAPTlOlTYFMPTAKEHYPySS? 
SSPSPPAQPGGVSRHSPLDCGSASPNKVASSSBQBAGSPPWBT 
THTOPPHSSSSSGSSSAHSHQQQPPGSVBQBPHPPI^MSQfVPPP 
BVHHIVFPIjATSSAPVAVPSTAPVTYPMPQTPMGCPQPTPKMET 
PAIRP PPHQTTAPHKN9ASVQNSSVAVI*SVRHIKRPHSVPSSyQ 
LPSTLSTQ3AOQMSVHPANKP IAPNFSAPLPPGPPSTLFKNS PT 
SAHJVFWGGSWSSQSTPKSmiSGKSSTLraSITPmOSDTSKAPG 
FRPPLQRPAPSPSOI VHMDS PYGSVXPSSTHU3NPASNISGGQM 
TGPGAPLGGAPAAAHPNSQHFSPLSLLTPCSSASNDSSAQSVSS 
GVRAPS PAPSSVPLGS BKPSNVSQiDRKVPVPIGTBRSARIRQTG 

TSAPSVIGSNLSTSVGHSGIWSFBGIGGKQDKVDWCNPQ^GNPM 
mRPMSDPGVFSQHQAMKRDSTGrVTPSGTPKQHVPAGYMDPPK 
VGGMPP8VY(JHAMIPPV7VPIPBGAGGPIFNGPHftRDPSKNS r ..IK 
MVSSSTENNGPQTVWTGPWAPHMHSVHMNQI/3 



GVTHLFLPGKRKLRKGIAEDIiRGQADPFPTiLVSKAWATGSPRA 
SLTCLILPLPGIIP^VTiPKAHSRPLUITFTPATDPSDLWKDGQQ 
QPQPRKPBSTLDOAAARAFIHALIGDSSSAPDSQRGQTBPARBR 
PUCKRK XMKAPRARAVABGASGRKSQGRBliHABDKMTHRI LRAA 
QTzenT.p gt.ppr.T jBP HBAGGftGGHXMRRDAPBWrPLMCRftRftGQO 
iUlVBrn^I/SRG/WftWVGVCBljSGRnAACLABBAGPPBVABttlVRBSH 
GBTRSPBinWPTPSIjQYCBHCDTHFQDSNHRTSTRhXLSLSQGP 
QPPNLPXGVPISSPGFKLLLRGGWKPGMGLGPKGEGRANPIPTV 
LKRDQKGLOYRSAPQPRVTHFPAMDTRftV»GRB\TPPaV7m*SJ< 
RKKRRRBB \ KBRAWBRDLRTYJWLEF 



1LGSYSSIQP 

SKGS PQKVHS rDFV\ BL\BHLQP0VLVHAVLRWDF/T I LTEAV 
YSYRGQKQKKVMLTVEQAQDQHVALVIJK3PG7kAW\YPQLQRKKG 
YIflBPKyi*PVQCirYTLEIILBLHTTPlfS5CBCLPDDDIRAXTPKA 
KFQKSAPSFVKISOLftTHLBDKCSGWIiXKAQISEDtAFP ITASQ 
KIAWC^HSSIiKSIPSSLPNIVYTGCAKDGIiBliSTDENRlYKOCP 
SCIiPPTMKXIYYRPALMTAIDGRH0VCXRVESKIJCBKILI^NISA 
DCLKRVIVPSSSITYGMVVWXLPHSIjIJIVSABPCVLKIQSLPVL 
DE^YPLQQDPSLLDFYPDIVKHGAHARL 



RG I PDRRRRGP I GRVNfmLEHKVKKMaiiGHKQG FGAPCLKCKKK 
CBGPBLHPWRKIC3arc\»VAKKSM/TVI*LSHB8DRKVGKI*P3DT 
KYTTiXAKLKSDGI PfiYXRNVMILTNPVW^KKHVSItrrVTYKWA 
PPVQNQMARQYMQMLPKEKQPVRGSHGAQYRKKQLAKQLPAHD 
0DPSKCHBLSPRFVKKM8QPVKKYESKM/5VGDVKLPCEWDAQG 
PKQMHXPGGDRSTPAAVGAMBDKSAKHKRTQXSCYCX3CLSMKBG 
DPAIYTVERAGYPKLWHPACFVCSTCHBLLVDmY FVnCNBK LYOG 
RHYCBSBKPftC3\GCDKI*IP3NBYTQABHCffnfflLKHFCCFDCDSI 
U^EIYVMVNDKPVCKPCYVKmmWCQGCHHAXDPEVQRVTYN 
NFSWHASTBCPLCSCCSKCLIGOKFMPVBGWVPCSVBCKKRMS 



XRKRYPVSGGSVliRSALBVCMDPLSGiTBGSLLPEGPFSGPlDQ 

QOnfQMRRRTOCHRGSAARHPSSPCSVKHSPTRKTXjTYJtOAQRM 

VBIEIEGRLHRISIPDPLBIILBPPLTAC^BMSBCKSNKEN SBRP 

PVaJOTRHKHHRVKKKIfBALPSAHGrPASnSALPBPKVRIVEY 
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SPP6APHRPIWfKFIBKSABKlJ>NBVBYDMDKBDYAHLBIVHB 
KRXGDCVPAVSQSHFEFI^RFKKESHCENQKQGBQQSLIDBDA 
VCCIC3<DGKX3>ISNVXLPCIH4CSLAVHQBCyGVPYI PEGQWI*C/ 
HAHCIX3SRARPADCVL(3 , NKGGAFKKXDDDRW3iV\ VCALH\ I P 
b\vopantvfibpidgvenipparwklt\chlckkkcis/vcaci 

QCHKA^CYTAFHVTCAaKnQLYNKMRPVKELTGGGTTPSVRKTA 
YCPVH TPPGCT RR PlHXTCPVBMKWGVCRKBSSVKTVRfiTSKVR 
KKAXBAKKALABPCAVLPTVCAPYXPPQRIiJRIAHQVAIQRKKQ 
FVEXtAHSyPnJiKBLSRUOAPLLRRLQSSljQSQRBSQQREHDBEH 
KAAZEKLKYWORLaHDliKRRHIiIiIBLLRRRBKLKBEQVTtVBQVA 
KBIaRLTPLTVI JLR3 VLDQUJDKD P ARXFAQ PVS LKKVPDYLDU I 
KHPMDPATKRE3U£RQGTKNU5KFKKDFDId: IDNCMKTTJARDTV 
FYKAAVRUUJQGGVVLRQARREVDS IGLBKASGHHLPER PAAAP 
RRPFSWBDVPKLLDPAMRAHUGI»EBQLRKl*I»DMLDIiTCAWKSSG 
SfiSERAKLLKKBIALIiRNKLSQQHSQPLPTCPGIiBGPBEDGAAL 
(JPBAGEBVLPHLBTLLQPBKRSRSTOGDSBVBBRSPGKRLDAGL 
TKGFGGARSBQBPGGGZ/3RKATPRHRCASB3S ISSSNS PLCDSS 
PHAPKOGSGKPAIVRRBTLEDRSBLISCXBNQIYAKAABZAAEV 
OQSSMWISTDAAASVLBPLKVVHAKCSGYPSYPALI IDPKMPRV 
PC aiHfigV TIPAPPIiPVlJCIGBBM Q T KS PBKLPliVLPFIBIKRSaO 
HLPKSKMVPLOIDBTIDKIJQIKEGHNSSIRKAVRIAPDRAMNHIi 



YRKRYPVSGGSVLRSaLBVCWDFLSG1»T8GS3jI>PEGPFSGPIDQ 
GNHYQMRRB3IICHRGSAAHHPS3PCSVKHSPTRBTLTYAQAQ3M 
VRIBIEGRLHRISIFDPLBI ILBDDLTAQa^SECNSHXBMSKRP 
PVCLRTKRHKMNHVKKKKEAIiPSAHQTPASASALPBPJWR I V3Y 
8PPaAPimPPVYYKPlHK5AEBIJ3NKVKYDMDKKIJY^5?I*BlVNB 
KRMQDCVPAVSQ3M?BPI^RPEKBSHCBNQW}EQQSI»IDKDA 
VCCI CMDOKCONSHVI LFCDMCNLAVHQBCYGVP Y I PBGQ WLC/ 
RAHCLQSRARPADCn/lX!PNKGGAFKKTDDDRWGHV\VCALW\ I P 
E\VGFANTVFIEPIDGVRjgiPPARWKLT\CNLCKBKGR/VaACr 
QCHKfiHCYTAPHVTGAQKAGLYMKMBPVKBLT CGG T TF SVRKTA 
YCCVHTPPOCTRRPLHIYQDVEHKNGVCRKBSSVKTVRSTSKVR 
KKAKKAKKALABPGAVI»PTV£APYIPPQBIiNRIAHQVAXQRKKQ 
FVHRAHSTYNLLECRliSRKGAPLlARLOSSLQSQHSSQQRBSDHEM 

MBUOiTPLlVLdjRSVLDQIiODKDPARI PAQPVSLKBVPDYLDHX 
KHPMDFATKRKHIAAQGYKNLHKPKSDPDLriXNCMYNARIKrV 
FTRAAVRLRDQGGVVLHOARRBVDS IGLKKASGWHLPKRPAAAP 
BfiPFSWB0yDS£JjDPAN<lAHLGI«BBQLRBLU3MIiDl/rCAHRS3O 
SRSKRAXLLKKECXALLKNKIjSQQHSQPLPTGPGELBO FKEDGAAlt 
GPBAGHBVLPRItBTItLQPRKRSEtSTOGDSBVBSBSPGKRLDAGIi 
TNT3FGGAR3EQKPGGGLGRKA1PRRRCASKSSXSSSNSPI/IDSS 
FHAP KCGROKPALVRRHTLKDRSBLX SCI KNGNYAKAARIAAKV 
GQSSMWlSTDAAASVLBPrJCWWAKCSGYPSYPALIlDPXMPRV 
PGHHSCVTIPAPPLDVLKIGEEMOTKSDKKLPLVLPFDHKRSWQ 

wlpkskmvplgidstidklkmmegrnssirkavriapdramnhl 

SRVHGKPT3DLSDXD 



BSRSLTMSTSQPGACPOOGAASRPAi LYAU^SSSLKAVPRPRSR 
CLCRQHRPVQIjCAPHRTCRBALDVLAKTVAFLftNLPSFHQItPPQ 
DQHRIJjQGCHGPI*FLIjG1AQDAVTPKVABAPVPSILKKIIiIiBKP 
SSSGGSGQIiPDRPQPSIAAVQWLQCCLBSFWSLHLSPKK \ YACL 
KGPILFNPDVPGLQAASHIGinXKJEAHWVLCEVLRPKCPAAQGR 
LTSVLLTASTLSSXPTSLl£DI*PPRPIIG0VDlAGLLGDHltLLR 



WNVKPSIiiVVOlxFKFSDKKKHSQNDS ISOKTGKTGVHKNIATRK 
VEQPS KBTVKLSHKDDHIIiEDAGSSDISSEAACT NPHCTKBSL V 
GLPSCP/DBVTBQ^KLKDTMGIADKTENTLBRBnCIBPT-GYCRDA 
ESHRQLfiSXBPHlCSKLBVVDTSTPGPKSNILEHAICDWPDQMSR 
QimiBSTB^KSHBTANIjODDRNSOSSSVSYLKSKSVKSKHTKP 
\^HSBQNMTTDAPBCKIVAAXXBVXHSXTKVNVKSVKRNTDVPBe 
QQ^FHRPViCVRKKQIDKBPKIQSCNSGVKSViafQAHSVLKKTLQ 
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DQTliVOI PKPLTHSLSDKSHAHPGCLKBPHHPA.QTGHVSHSSQK 

qchkpoqqapamktoshvkhblkhpgvehfkebdklklkkpekn 

LQPRQRRSSKSPSLDBPPLPIPDN1AT1HRS33DHSS3FESKYM 
WTPSKQCCPC 3 aCpHGHHFTnVGCGRCI^fFHGIX^GI»SbS 0 AQ 0 M 
QHKDKEYVCVKCCaBKDKKTBIIiDPDTLKNQATVKFHSGDiUMB 
CBKUSUSKHTTHDH'reYTDDTVKHKVKIIiKHBSGEORMSSDCHD 
NBIKKWQIAPX»8KP«?Q?VLPRRSSKKKSKICIPKEST7VTCTGEK 
ASKPGTHBRQBMKKKXV\EKBVl*n/HPAASASKPSADQIRQSVR 
BSIiKDXZMKRLlDSHIkXVPBBKAAKVATKlBKBLFSFFHinTAE 
YKNKyRSLMFNLRDPKHHlLPKKVUOSBVTPDHLIBMSPEBlAS 
K5LAAWRRRBNRHTIEMIBKBQREVKRRPITKXTHKGBXSXESD 
APMKBQBAAHRlQHPAWJKSLBKPBGSBK\R3EEVDSMSKrrrrS 
QHRQHLFDLNCKICIGKMaPPVDDIiSPKKVKVWGVARKHSXJNB 
ABS 1ADALSSTSHU.ASEPPBEBKQBS PKST7SPAPHPBHPGTV 
BVKSTFLARLNPlWffiPINHPSVaKPVTKAYPVSGSPBYLTHDL 
PDSIQVGC5RISPC7IVWDYVBKIKASGTKBICVVRFTPVTKEDQI 
SYTUjPAYPSSRKRYGVAANNHKQVKEMYLI plgatdkiphflv 
P PDG PGLBLHRKCJiIiGLI IRQKltKRQHSACASTSHIABTPRSA 
PP IALPPDKK3KIBVSTBEAPBEBNDFFN3 PTTVLHKQRNKPQQ 
HIiQBDIiPTAVBPIiHEVTKOBPPKPUlf’ZrPGVLIGWENOPTniBL 
ANKPIiPVDDIbQS[iLGTTGQVYX)Q\AOSVHBQilTVECBIPPLKEQ 
TWSKX BK .TD HV K V TPGS HK B1KVEVDNISB3TDKSAH1BTSWG 
SSS ISAGSLTSLSLRGKPPOVSTBAFL1MLS IQSEQBETVESKB 
KTLXRQLQKIX}EHNIiQr»JQT3NSS PCKSNVGRGNIDGNVS CEKN 
LVABTARSPQPINLKRDPRQAAGRSQPVTTSESKDGDSCRNnBK 

hmi*poishnkbhlteqihvkeklcsakkkscvqqsewlkvaqns 

PSVKKIC7rSQAB0AKPLQBDHjMQirEBTVHPPRRGSAVATSHFK 

VGNTCPSEFPSKSITPTSILSTSPRTSINPSPMRPQQPUIiQHUCS 

SPPaPPFPGPPHBTPQSMFOPPPHLPPPLLPPPGFG\PA\QNPM 

VPWPPW\HLP\GQPQRMMGPLSQASRYIGPCNFYQVKDZRRPB 

RRKS^PWGR01X)QQ1J>HPPNRGRBDHQRFYSDSHHLKRBRHEKB 

WBQgSR^HHRRDRSQDKDHDRKSRBKGHKDKBRARIiSEGDRGXP 

GKASRDSRNVDKXPDKPKSBDYKKDKKREKSKHREGKKDRDRYH 

KDRDHTDRTKSKH 



PPARR3ARDI*PRAL3MKAARPSGSWIJGAIX3tLL\ljVTLVAFIiI P 
ASDACKNVTLHVPSKXJlAEKLVGRVHIiKBCFEAAlfLIHSSDPDP 
0II*KIX5SVYTrSTILI*SSKKRSFTIIiI*SNTKNQEKKKI PVFLBH 
QTKVIjXKRHTKBXVLSRAICRRWAP I PCSMLKKSLGPFPLFLQQV 
QSDTAQNrriYYSIRGPGVIX2HPHNI»FyVKRim3n,YCTttpVDR 
BQ YES FBI IAFATTPDGYTPELPLPL I IKIKDENDNYPIPTEBT 
YTFTI PBNCRY O TT V GQVCftTDKDBPDTMHTRLKTOI IGQVPP3 
PTLPSMHPTTOVlTTTSSQIiDRBLlDKYQLKlKVfeWflXKJYFGI* 
QTTSTCI INXDDViroHLPTFTRTSYVTSVKKirrVDVBIijRVTVE 
DKDLVHTANWRANmiiKQ9QE37FKlVTDh]C , nfB6VliCWKPI, 
NYKBKQQMrLOIGVVNKAPFSREASPRSAMSTATVTVNVBDQDK 
GPBCNPPiaTVHMKEKABVGTTSKBYKAYDPBTRSSSGIRYICKI* 
TDE’TGHVTlDSHPGSIKVFRSIjDfiEABTIKBErYHITVLASDQG 
OmrCTCTLGIHj(^TVHnHSPPIPRETVlICZCPTHSSABIVAVDP 
DEPIHGPPFDPSLBSSTSKVQRMWRI < KAINmyVARI3YCN'DPPP 
GSYWPrTVEDRliOMSSVTSLDVniCDCITBJJDCTHRVDPRIGG 
GGVCLGKWAILAIIiLGIALPFCILFTIjVCGASGTSKQPICVI pdd 
X*AQQNLrVSNraAPGDDKVYSASGPTTQTVGASAQGVOGTVGSG 
I KNGGQBT I EMVKGGHQTS BS CRGAflHHHTLDS CBGGI-tTHVDKC 
RYTYSEWBS PTQPBL6EBS IRGHTLIHN 



P [XCNPDPGHYKWVKQBSBI SKBSQBMDARPKU5U3PKBGQTI K 
LdGMITNKKGGASKFRTARGGOLSlXP PPPGGKVTI PPP3S / V 
KLPSTNHVTPPS I PKSHHGGSDADXLLDLDS PAPVTT PAPTP VS 
VSmJLWGDFSTRSSSVPNOAPQPSNWVOF 



PPPPPPKLANMADtEAVIiADVSYLMAMBlCSKATPAAaASKR 
PBPSIRSVMQKYLABRNBITFDJCIFNQKXGFIjLFKDFCLNBINB 
AVPQVKFYB3IKBYEKLDHEEDRIjC3tSRQIYDAYXMXBLLSC$H 



4 : 
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[IMpM 








PFSKC^VBHVQSHLSKKQVTSTbFQPYIBEICKSLRGDIFQKFM 
B5D)LPTRPCQWKIiVSlJirHLTMNEFSVHHI IGRGGPGEVYGCRK 
/U)TGHMYAMKCLNKKHIKMKQGETLALNKRIMI>SLVSTGDCPPI 
VCTfryAFKTPDKLCFILDLMNGGDLHYHLSQHGVFSHKKPTOPYA 
TSI IijGLKHMHHHPVVYRDLKPAHTIiti)BB3HARIS\DLQLACD 
FSKK3^HASVGTHGYNAPSVI<}KGTA2frX5SADf7F3LGCMLFKLI* 
RGHSPntttHKTKDKHBlDRATTLTVHVKLPIJTFSPELKSLLEGLI, 








RCKVNAADAPDIGBFDBBI>TKGnCLLDCDQEL7KHPPLVXSBRW 
OQHVTHTVYEAVNADTDKI EAKKJU^KNKQLGHEKDYAbGCDCIM 
BGYtfLKXJGKPPLT0MQRR7FyLFPl!nU£HRGBGBSnQHI«I«l>fBQ 
XI*S VHBTQ 1 ECDKKCrLFRIKEGKQPVLQCBSDPBFVQWKKKIjNB 
TPKKAQMiERAPKFI^KPRSGTVBLPKPSLCBHHSNGL 


i 


■ 


444 


KRBHBPRIAVCPUlYPSAyBSSPCn^LREOGLCRSGQKFADCRH 
PANRQDVr^GtflHLPVLQLTKDPLKfJt^iRlJMGTRTAFTHHRfiO 
VHKRCINXHRDVGLPGVLKBIANSEBBVFBIIVKTAS6MALALCR 
H7VSSLHGSLFPHLSLRSKDIiIABFAQVTNWSSCCr^VFAUHPHT 
HKPAVAUjDQSVRVYNRSSTIVPSIiKHRLQHNVASLAStKPLSAS 
VIAVACQSCII^WTIJDPTSLCTRPSSGCnQVLSHPGHTPVTSLA 
WAPSGGRIJjSAS PVDAATRVHDV3TBTCVPLPWFRGGGVTOI»LW 
SPDGSKIJjATTPSAVFRVMKAQMWTCKaWPTLSGROQTGCHS PD 
G3RUJTVLGBPIiIYSbSPPEROGKGKE\ALBVQSQQRWTQICIi 
RQQYRHQMVRRGbGSRIjTPWSGTPVQaVWLCL 


• 


1705 

. 

' 

. - 


139 

• 


GVGVRGARAMATVQBKAAAXda«SALHSPAH&PPGPSVAQKPPGA 
TYVWSSI INTDQTQVKVKKRRHRLKRHNDCFVOSEA.VDVI PSHL 
IQ9SKyFGDVDIPRAICWRVCQMJ40YKVFEAVPTKVFGKDKKPT 
PKnSSCSLYRPTTIPNQDSQUJKEHKLYSPARYADkLyKSSDIR 
SASbEDbWEmsbKPAKSPaVKISATLSPQVIWBVWQBETIGRb 
LQLVDiPLLDSLIiKQQBAVPKlPQPKROSTWVKSSHYIiDRGIX^K 
AYSDSQBOBWbSAAIDCSBYLPDQMWKlSRSPPEOPDRTDLVK 
HliFDAIGRYYS SRKPLLNHLS DVHNGIAKLLVNGKYEIAIxKAT 
OIJjI*KUiDPQNHEBFRttlJ>YPMAVAflHPSBFKLQKKKTOMVVK 
RIPSKAXVDWKNLSXC3C1D1 jLVIjFL\MDHQH 33VFKIPGTL\HKI 
VS\VK\LMM®*BW)PtnujAGYXY0QRIDQRDy3HHrEKTTKI)B 
LLHLLETl>BBX)SKLGAKBKKK\LIiGQFyKCHPDI FI BBFGD 


i 

V 

i 


1479 


451 


ELQVA\^MimiDHVVXPKXKRAKRPLBKRBPKI2IKHl KNAMLI K 
GGNftNATVTKVLKDVYMiKKPYGVra 

SKKSDCSLPMFGSHHKKRPMILVIGRMYDYHVIjDMIEI/JIBHFV 
SLKDI KHSICCPBGTICPMLIFAGPPFDVTBiyYRRI»K3hbTrFFRG 

ptvs^biaglkyvlhftalijgxiyfrsyklllkksgcrxprib 

LKBIiGPSLDLVXJftBTHIASDDLYKLSHKMPKAIiKPKKKKlIISHD 

TFGTTYORIHMQKQDLSKLOTRKMXKDIiKKRPAKRIX'HDHBKKS 

IGIIKKKIjMBLSQPLLPHCVIiXjXRXIKHQSXQSFIi 


1 3959 

i 

1 

• 

i 

i 

i 

L 


1 


3130 


AAAbGMIJ^HFPACQAFNIiDVKKLTVYSGPXBSYPGYAVDPHlPD 

arxasvlvurpkantsqpdi veggavyy cpwpaegsaqcrqi p p 
dttkhrkirvkgtkkpibpkshqwpg\atvka\hkgksggfvap 

LLPTWBNPLKPTPBKGPVGTCYVJVIQNPSAYABFSPOGNSNADP 
EGQGYCOAGFSIJ)FYKNGDLIVGGPGEFYWQGQVITACVADIIA 
NYS PKDI LRXLAGBKQTBVAPAS YDDS YI/3Y3VAAGBFTGDSQQ 
BbVRGIPHGAQPPGTVS riNSYDMTPXQHFTGBQMASYFGYTVV 
VSDVNSIXJLDDVLVGAPLPMKRKFKSNPRBVGQIYLYIiQVSSLJ, 
FRDPQ ILTGTKTFGRFGSAMAHLGDU^DGYNDIAIGVPFAGKD 
QRGKVLIYKtSnfflGLHTKPFPKPCQGWKASnAVPSGPGPTLRGD 
SPIDKNDYPDLIVGAPGTGKVAVYRflRPVVTVDflQIJiLHPMI IN 
I.ENKTCXJVPDSMKAACPSLHVCAS VXGQ8 IAOT IVLMJVJEJVQU) 
S LKQHGAI KATIiFIJJNHQAHRVFPLVXKROKSHQCQDFrVYtiRr) 
ETBFRDKLSPINISLRYSLDBSTFKEGIiEYKPII^YYRENXVSE 
QAH7LVDCGEDHLCYPDZ*KLSARPDKHOVI IGDKNRLMLJ LNAH 
NEGHGAYKAKLFVMIPKSADYVGIERtWKGPRPLSCBTKMKNVT 
RMWCDXiGHPMVSGTNYSLGLRFAVPRLBETNHSINFDIXlIRSS 
HtCDUPDSNPVStOIHlTAVAQVBlRGVSHPPQIVLPJHNWKPKB 
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SSo acid fjftgtpftnt con tain ing signal peptide 
(Aw ftlanlne , O Cysteine, D=Aspartic Acid, 
Glutamic Acid, P=*Pbenyialanine, G=Glycine, 
H=Hls t idine , I=»I soleucine , K=Lysine , 

L= Leucine. pt=Kethiani ne , tfcAsparagine , 
P=Proline, Q=Glutamine. EUJUrginine, 

S=Serine, TVThreoaine , V=Valine, 

Tryptophan, Y^Tyrosine, X»Uhknown, *»Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 



BPH3CBEBVGPLVBHIYKLHNTGPSTISDTXhKVGWPPSARDBPL 
LYIFHIQTlJGPLQCQPHPNIKPQDIKPAASPSDTPELSAPliRWS 
TI PHLVRKRDVHWKFKRQSPAKL IiNCTNI ECLQ I S CAVGRLEG 
GBSAVLKVESRLKAHTPIiQRKNDPYALASLVSFEVKKMPYTDQP 
AKLPBGS IAI KTSVIWArPHVS FS IPLWVI ILATU GLLVLAXL 
TLALffKCCFPDaAapPQEDMTIlEBQX*TNI)KTPEA 



GTSGXAAQQLPSIjLKKKEPHOQTIiIiIKVPASGHIiHHHQVVGCTTKC 
NTliPVVIJVQTSQITKIPn/KDREPQGVTQOGCGXHAIBliJPSRT 
LLATGSDfPMSIATraijPTUJPVCVGDDGflKIWIPSIAWISDTM 
AVSGSRDGSMGWEVTODVLTKSDARHNVSHVPVYAHITHKALK 
DZ PKKDraPDNCKVRAIjAPNHKinCEIjGAVSIJjGYPHLSinCABrnT. 
SBCLLSTKLPYCRBHVCLAYGSBWSVYAVGSQAHVSPLPPRQPSY 
NVK£VC8RHRGSGIIl£V3PYElIirTVGTOQGSULFYDIRAQRFL 
BKHLSACYOSKPRLAGEHLKWlt5\RGHLHHDBTWRNYFSDXDP 
PPHAVYTHCYDSSGTKIjFVAGGPLPSGIJlGNlfAGIjSiS 



FVWSDG3PRPRKGPAVGAGAAHLSDPWAMTPGTARRATNPLKKB 
LDWASINGFCKQIJJSDPBGPPLATRLLAHK1QSPQBWBAI0ALT 
VLBTCMKSOOKB PHUBV ' GR PRPUBLIKVVSPKYLGSRTSKECVK 
NKILKLLYSWTVGLPBEVKIABAYQMLKKQG\ rVKSDPKLPDDT 
TPPLPPPPPKNVX PBDSBKSKMIiARIiLSSSHPBDLSAAHKLZKB 
KVQSDQKHMBOSKRVMAIHBVHHNVKLLTEMVKSSSQGGAAAO 
SSBDL\MKBL\YQRCTRMIUrriiFPT(aVDTBI)ND\BAIAEILOA 
NimTOVIHLYKQLVRGKKVN3imXftGSIPGSTSALLDX*SGLDL 
PPAGTTYPAMPTRPGBQASPBQPSASVSIJaJDELKSIXaiSDPTP 
PSOPSUXTrGWSPQSSDATBPPAPALAOAPSHBCTPPAQTSLP 
ASSGIiDDLDIiIiGKTUXJQSLPPBSQQVKWBKQQPTPRLTLRDIX) 
NSCSSSCSSPSSSATSUjHTVSPBPF^PPQQPVPTELSIASITVP 
LBSlKPSNILPVTVXDQHGPRrLPHPAFDPLPGASDVLWWSM 
LSTAPQPIRNlVPQSAVPKVMKVKLQPPSGTKLPAFNPrVHPSA 
| ITQVIdiiANPQKBKVRI»RYKLTPTMGDQTYNE>rcDVDQPPPPBT 
WGSL 



GLKVAI KmDKKAMYKAGMVQRVQNKVIcrHCQLKHPSILELYKY 
FBDSNY7YLVLKMCHNGKMHRYLKHHVKPFS EtfKARHFKHQ I IT 
GKLYUlSlSILBRDLTl^Nl^TEQDQaKXADPGLATQLEnPHB 
KHYTLCGTPNYI6P3IATaSAra3LESDVWSLGCMFYTLI*iaRPP 
PPTDT Y KMTI/fgyvi«ADYKHPTPIi3IRRKDLIHQLLRKHPADRI» 
3Ii3SVU)HPFM3BNSSTKSKDLGTVBDSIDSGHhTISTAlTASS 
STSISGSLFDKRHLLIGQPLPHKMTVFPKHK5STDFSSSGDCTS 

fytghgnqbtsiisgrhwiqdabbrphsrxlhrayssohsotss 
SQSQAKTYTMKRCBSABMI^SICRSGGGKNBBRYSPT ataHMJIP 
NPFKEKl’SSSSGSFHRPDNNQALSNHLCPGKTPPPPADPTPQTB 
TVQQWFCan^IKAmj?KTTEYI>SISPNRDPQGHPDLQrcDT 3 KHA 
Hf I7r KV ilKHSDASia?AH3VKQQNl!HK?MTALHSKPBI IQQBCVP 
GSDPLSEQSKTRGM3PPWGYQNRTLHS ITSPLVAHRLKPIHQKT 
KKAW3 ZLDSBEVCVKLVKEYASQBYVKBVLQI SSDGNTITXYY 
PSC3G\RGPPLA\DRPPSPT\DNISR\YSP\tMt»PBKYHRKYQYA 

SRPVQLVRSKS pkityftryakcilmenspgadfkvwfyegvki 

HKTBDPIGVlIHKTGXSYnJCSBSBVHSLKBBIKKYMDHWIBGHR 

iclalbsizsbrbrktrsappppiiigrkpgstsspkalsppps 
VDS^PTRD ftASFHRMVMHSAASPTQAP ILNPSMVTH3QLGLTT 
TASGTDISSNSLKDCIiPKSAQl»l*K3VPVKNVOIATQ\UPSGAVK 
VQPNDGSQLWQAOVSS IS YTS PNGQ\ TTR\YGBNBKLFDYI KQ 
KLQCLSSILU1PSNPTPNPH - 



RVCSS3ASTASQAVMADAWHEIRRLAADFQRAQFAKATQRLSBR 

NCHBIVNK3*lAQKQLEVVHTLDGKBYITPAQISKEMRDBIfiVHG 

GRVUIVDLQQVIUVBLIHIBHRXGDIIKSBKKVOLVIXSQIjIDEH 

YLDHLABBVNDKLQESGCfVTI SBLCKTYDLPfflJPtiTOALTQRIX? 

RIISGHIDLDHRGVIFnSAFVAJUIRARIRGLESAITRPTAVNSI* 
ISKY0I1?HQI«LY5VLSgLVNSGRI^CTVVGGRai>KAV PVPDIY S 
jmJSTWVDSFFRQMGYLKFDALSRLGI PDAVSYIKKRYKTTQLL 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
<A«Alanine, C=Cyateine, D=Aspartic Acid, B=* 
Glutamic Acid, F-Phenylalani nc , G=Glycixue, 
HaBistidine, I=Isoleucine, BULysine. 
l^Leudne, M^Methionine, N^Asparagine, 

P«= Praline, Q=Glut amine. R -Arginine, 

SsSerine, X-Threonine, V= Valine, 

H=*Tryptophan, Y=»Tyrosine, X»Unkno*n, *~£top 
Codon, /^possible nucleotide deletion, 

\ -possible nucleotide insertion) 




PKNPQDFPGHRGLHG\QKGBIGPP\GQQC3KKQAPGMP\QI^IGSN 
GSPGQPGTPG3KGSKOBPGIQGWPGASGIJCGHPGATGSPGBPGY 
MCa*POIQGKKGDKGH0GBKGIQGQKGBNt5RQGIPGQQGIQGHH5 
AKGKRGKKGBPGVRGAIGSKGBSGVDGWCPAGPKGQPGDPGPQ 
GPPGLDGKPGRBFSBQPIRQ V L T i) VTIR AQIiPVIJiQSG R TOtfCP H 
CLSQHGSPGIPGPPGP3GPKGPHGLPGLPGHDGVPGLVGVPGRP 
GVRGLKGLPGRKGEKESQGPGYPGEQGPPGPPGPBGrPPGISKBG 
PPGDPGLPGKDGDEK5KPG1QG0PGPPGICDPSLCFSVIARRDPP 
EKGPHY 



SCRTBGPiapLQPRBAGSSBGSBARSBPPRPGQIBSACQVQrrK 
RGDPHKLRHXFIjQYASTEVDGKRYMTPHDFVQHYLGLXHDPHSS 
PKTV^UjAOVAlXnxJXnJ SyQKFIAFKSVLCAPnSME^TVAPQL 
FDKSG BGB V TPB NVKBl PGQTl HHHHXPPWWDCBFXRLHFGHHR 
kkhlkytbftqfi»qklgi*bhahoafaijcdksksghisgi«dfsdi 

MVTrRSHKLTPFVEENLVSAAGGSISHQVSFSYFRAPNSLLKNM 
ELVRKIYSTtAGTRKDABVTKBBFAQSAIRYGOATPLSXDILYQ 
LRI1LYHASGRLTX»ADI ERXAPlAEGAlPYtttAHLQRQQSPGLGR 
P rWLQ IABSAYRFTLGSVAGAVGATAVy? IDLVKTRMQNQRG6G 
SVVGELMjrKMSFDCFKXVLRYBGFFGLYRGIirPQLIGVAPBKAI 
KLTVKDPVRDKFTRRDGSVPJjPAKVIiAGGCAGGSQVI FTHPLEI 
VKIRLOVAGBITTG PRVSAUTVLRDIjGI FGLYKGAKACFLRDIP 
PSArYFPVYAHCKLLLMffiNGHVGGUJljlAAG7MAG\VPAASLV 

tpadvi ktrlqvaaragqttysgvxdcfrkilNbbbgpsafiikq 

TAAKVFRS5PQPG\VTLVTYKLMJHGPYIDFa3IiKPA3SBPTPK 
SRIADLPPANPDH IGGYIUATATFAG IKNKFGLYLPKFKS PSVA 
WQPKAAVAATQ 



MVTVfljYRFLPTSNMAAKI^LlPPDLRIiQFWlJlARLQKCFLSaG 
OGSYCRGAKASPX^PGKMAMOIJiOGRFRT.TiRr J QSGRRVHSVAGP 
SQWXaCPLTTra*IOTAAPCCCRPHYlPIAASGPKSLSTSAISPA 
EV0V0APPVVAATPSPTAVPBVA8GBXADVVQTAAEQSFABLGL 
GSYTPVGUQNLLBFKHVDLG^PIWGAIAACrVFAECLXFPLIV 
TGQREAAHmNHlPBIQKF^SRIRKAKIAGDHJLKyYKASSBMAIi 
YQXKHGI KLYKPLI X.PVTQAPIF I S FFIAlABMANLPVPStOTG 
GLHWFQDUVSDPrYIIiPLAVTATKWAVLBUSftBIGVQSSDI^JW 
MRNVXRMMPL ITIiPITMHFPTA VPMYWI^SKLFSLVQVSCLR 1 P 
AVRTVLKI PQRWHDLDKLpPREGPlES FKKGWKHAEMTRQLRB 
HBttMEKQIjEIiAAI^PIJiQTFTHNPIjLQPGKDNPPNI PSS\fiSS 
SSKPKSKYPWHDTLG ' 



RSKQVMARLTKRRQADTKAIQKLWAAIB IIRNQKQIANI DRITK 
YMSRVHS4HPKKTTRQI^IAVKDGI.IVBTLTVGCKGSKW3IEQB 
GTHLPGDB ItnfETENHD>nrCFBCHlPGKVLI CPDCF RVYH SKCL 
SDBFRIJRDSSSPWQCPVCE6IKKKNTNKQBMGTYLRFXVSHMKB 
RAIDLNKKG HDN KHPHY1 UUjVHSAVT>VPTI QBKVNBGKYRSYHB 
PKADAQIJJ*HNT\aPYGAnSBQRDIAKttYianXlffiL\DBIiC2LC 
KlK!FYlANRRPDi?WFCYPCIPNHELDWAKMB9FCSPWPAKVMQKB 
IMQVDVRFFGHHSQRABI PSantQDITVNIHHlHVKRSMQfKKA 
CDBLBLttaRFLRBGRPWKSKNEDHGKKKZ\BSSISSTSNBQIiKVT 

qkprakkgrhkqsvbpkkebpbpbtbavsssqkiptmpopxbkv 

SVSTQTffKX>SA33PRKLHRSTQTTNDGVC0SKCHDKYTKIFNDF 
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acid segment containing signal peptide 
beg inning nucleotide _ lA^AianlnA, C=*Cystoine, lhAspartic Add, Bo 

nucleotide location Glutamic Acid, P*> Phenylalanine, G^lyclne, 

location corresponding H^Histidine, J»Isoleucine, K= Lysine, 

corr es ponding to first L^Leucine, M=Methionine, NsAaparagine, 

to first amino acid P^Proline, Q=Glutamine , R°Argininn, 

amino acid residue of S^Sexine, T^Thre oni ne , V=Valine, 

residue of wn^rm acid »=Tryptophan, t=Tyxosinc, Unknown, *«=Stop 

amino acid sequence Codon, /= possible nucleotide deletion, 

sequence \=possible nucleotide insertion) 



XDH>4KSlJHK3<BTKRWRBAl^KIiRSKHKBBK3cQAVti KAVANNQG 
BMDRKCKQVKBKCKEBPVEBlKKIATX?HKQLISQTKiaCQWCrHIC 
EHEAMYHCCWNTSYCSIKOQQEHWHAEHKRTCXRKR 



as BLQVMARLTXRRQADTKAXQHLWAAIBI IRHQKQIAKXDR1TK 
7HS»VHa<HPSETTRQI^IAVB3)GUVBThTVGC3tGSKAGXHQB 
CJYWIJ<3)B2DWBTBNHDWYCFECELPGBVbICDLCFRVYHSKCL 
SDKPRIiKIiSSSPWQCPVCftSlKKJCOTNKQfMGTYlilPlVSRMKB 
RUDUIKKGKpHBHPMYRRbVHSAVDVPTICgKViaBGKYIlSYKB 
FKADAQLLL HH T VXP TPGaPSEQADIAKMLyiCDTCBBL\DgX^yiC 
IO}CFYLfiHAHPtWW P C TP CnyHHElJ>WAaMXGyGgWPAHyKQKH 
ENQVDVRPFGHHKQRASflPSSNIQDITVHIHamVKRSfGWKKA 
CDBLKUKJRFLaBGHPMKS KNKDROKEEAKSS ISSTSHBQLKVT 

o^peakktsihhqsvbpkkebpbpetkavsssqbiptmpqpikkv 
SVSTQTKKLSASSPRMIflRSTQ^nSDGyC^MCHnKyTKIPlHJP 
KDRHESDHKRBT6RVVRRAI*BKltR9EMBEBEHQAVNKAVAKMQG 
BKDaKCKO VK K KCT EBPVBBIKKLATgOCOLI SOTKKKQIi CXHC 

bbeamybochntsycsikoqqkhwhakhkktcerkr 



vrfpreggapptvltpgrqqgvfijgpqrpgskpdipargqphpp 

RPVGVSTSJWQAQVQPPM<HRRRIJdXn>GFCXLACTBIjSVUJVYI> 
SNMLPVByVPYYl*PCPKIPNMICLHyKRKKPLQPVVPfSOyPQPKI> 
I^HRPTOIXTLTTWLAPrVSBGTPHPBbLOHlTQPIiNLTIGVTV 
PAVCT/HFLBSAEEPFKR0YRVHYY1PTDSPAAVPGVPU3PHRI. 
Z^SXPlQORSRHEBTSlGiRMBTlSQHIAKRABSSVDYLPCUDVD 
MVFRHPWGPBTMmLVAAIHPSTStAVPRQQPPYKHRRySlAPVA 
DSBOTFYYGGAVFGGOVARVYKFTRGCHMAILADtANG INAAWR 
KSSHLMRHFISNBPSKVXjSPBYLHDDRXPQPPSLKLIRFSTLDK 
DISdiRS r 



DVG PHI BCRKRCDLD VFIiKSPRKPSGRRDRAPEKQRK IAAN KGLC 
TGVRKGEPPS/TTSOKVKBAGRDFTYLIWljraiSirGGLFYTI 
PXBLFSSSSPSKIYGRAIiBKCRSHPSVIGVFGBSVXOYGBVTRR 
GRJ^HVRPTByVKIXJLKHTCVXFYXSGSBPGKOGTVYAQVKENP 
OSGKYDFRYIFVEXKSYPRRTI I IBDKRSQDD 



SQDifXGRRliLOXHGW KLGQGLGKB LQGETDPXPIVVKYDVrWWG 
RMEHKLD YAKDATBRJlRVLBVBKKDTEKLRQWf KDYVDKBKAIA 
KALSDLRANPYCELCDKQYQEERQBFDNHniSyDHABEQ&UCDIJC 
QRgPAKNVSSRSRKDSXKQEKALRRIiHHI ABQRKQABCAPGSGP 

mpkpitvavdbbggkddkdbsatnsgtgatasoglgs epstdrg 

GPFTAVQ lTOI T BL ftQAPGtiASQGXSPOI KHNUSTPLQKIjGVSF 

GDSIX^SNIJWKKEDEDPQDGGSIASTLS KLKRMKRKEGAGATE 
PEYYHYTPPAHCKVKPOTPPIAFMRASEQMDGIIOTTHPKNAPES 
KKGSSPKPKSCI KAAASQGMSKTVSKVSBQP KBTSMTBPSKPGS 
KAEAKKAI/5GDVSDOSLBSHSQKVSBTQMCKSH3SKBTS1ATPA 
GKSSQSGPKHPropPFPVLSKDBSTALQWPSELUFXKABPSIS 
YSCSPLYTOFKLSRHKIXARTKGTBKPKDlGSSSKDHbQGLDPGE 
PKESKBVGGBKIVRSSGGaMQAPASGSACSGLBSQBPGGSHGSB 
TEDTGRSLPSKKBRSGKSHRHKKKKKHKKSSKHXRHHKADTBEK 
SSKABSGBKSKKRKKHKRKKNKSSAPADSKBGPKPKPPGSGSPA 
ppbbhrraqddsqrrslpakhgssgxkdeggggsssqdhggrkh 
kislppsscqrragticrssrsshhsqpssgdedsddasshrijjq 

KSPSQYSBEEBBBDSGSKHSRSRSRSGRRHS3HR3SRRSYSSSS 
DASSOQS CSrSRQRS YSDDSYSDYSDKSRRHSKHSHDSDDSDYAS 
SPMRSKRHKYSSSDDDYSbSCSQSRSRSRSHTRBRSRSRGESRS 
SSCSRSRSKRHSRSTTAHSWQRSRSTSRDRSRSTHSPSQRSGSR 
KRSyGHESPEKRHSGRRDPIRSKIYBSQSPEnrPRSGRGEGPGKK 
DDGHGDDSKA'PGPpSQHSNIGTGSKJSKGDCSPKDIHfiVTAKLLIi 
KXIQSRKVKRKPSVSKBVQATPHKAGPKLKDPPQGYFGPKLPPS 
l/2JKPVLPI*IGKLPATRKPinOCCSKSGL3RGKBQEQSBTEBGPP 
gssdai*pghqpp\sebttgpi*ldpppbes ksgfvtadhfvaplg 
PPAHPDCYLtTOPTISHimJDPSDGWTLBSUJSSSQPGPVBSSI. 

LPIAPDIiKHFPSYAPPSGDPSIBSTDGAKDA\SI*APLESQPITP 















beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

aminfl acid 



nucleotide 
location.'::- 
corresponding 
to first 
amino acid 
residue of 

ami no add 

sequence 




Ammo acid segment containing 
(AsAlanine, C=Cysteino, D=»Aspartic Acid, K=» 
Glutamic Acid, F» Phenylalanine, G=Glycine p ' 
H^Hiatidine, I^Iaoleudne, K^Lysine, 
L^Leucine, M«Methianine , H=Asparagine, 
Pnproline, CMHut amine, R=Arginine, 

S=Serine, T^Threcmine, v=Valina, 
lf=Tryptophan, ?=Tyrosiue, X-unknovn, *«=Stop 
C Qdon, /^possible nucleotide deletion, 
\=po38iblo nucleotide insertion) 



TPBEnBKYSBXQQAAQQHIQQQhLAKQVKAFPASAAliAPATPAL 
QPIHIQQPATASATSrTrVQHAILQHHAAAAAAAIGIHPHPHPQ 
PLRQVHHXPQPHLTPISLSHLTHSI I PGHPATPLASHPIHXIPA 
SAIHPGPFTFHPVPH3\ALYPTLIjAPRPAAAAATAIjHLHPLLHP I 
FSGODLQHPPSHGT 



SFuYIWGVDMDNPIGctWDGRPDGVQIXIQFACVBSTIIjIiHIHDII 
PB5VTQESftPPKLAmSRGVt9>£Q9SSSHNXPKATGST5I>PC3SRH 
RSELFYTUJGSSVDSQPQSKSKHXWIDSVAEDPAKSLTBISTD 
PDa3SPPbQPPPVBSLTTHIISFHSLPPSLTKNPNXHGSIGHSPL 
SLSAQSVMKKUmU»VQSSPPIiAMPPGHSHGI^VGSIABVKEHP 
PFlGVXRWlGQPPGlJfBVLAGljBIiIJBCAG\C , nX5TP/RBt5TBY 
FTCALKKAIiFVKLKSCRPnSRFASIjQPVSNQIKRCHSLAIHKAY 
I*SKWEEHTPTQJWKKBGLBIMIG\KKKGIQOTSHS CTtiDSTIiF 
CI^APSSVIdrrVIiUlPKBKNIJVKYYSETQEIjIji?TB IVNPLRIYG 
YVCATKIMKLRKILBKVBAASGFT3BEKDPBBPLNILFHHILKV 
BPLLXlRaAGQKVODCyPYOI FMBKNBKVGVPTlQQIiEHSPlN 
SmxKPASAPSaaiQMPRPGKDFKLFKKlPPSLSLiJITDLLHDT 
FRQCRXOGGIAMYBCRBCYDDPni SAGK1SQPCKTQ3T0VHLHP 
HWOTKYHPVSJjPKDLPDIiniWRHGCXPOaHMELFAVLCIBTGHT 
VAPV KY G KDD SAHLPPDSMRDRDGGQfeCFIflPQVTPCPSVGBYI* 
BHSl*RDUSI»DSgMQGC!!MUflUt^g>AntVPCTQ3PIM3I»YR 



ILIAGS PSPRDQCSGRQSSGGDKKLiVTHGCTPSTAVVS PSAM1Q 
HPFRKBbAYDHMPTLKRGRQBPASYAPCAKPSDLOLSKRLFPCF 
SHHTnVPSVLMGSCLLVTSGFSItTl^SVVFPAEHDfLRCAAGSCl 
PSA1 VSFTVSSRHAHVI PHFQIiPV STFJWTTTCLXN FGCKLVI* 



I V^* iT * \ • s i 






DSAtf ILDBVPPPARVLKSYSVVBVIAGISAVIGG 1 1ALNVDDSV 
SGPHLSVTFFWH»VACFPSAIASHV?iABCPNKCLVKVLIAISSI» 
TSPLLPTASGYLSFSIHRIVEMPKDV?PAIKPSYPVIiIJ>I»I,TJ.V 
LLLQA/GPQHGHRHFVRALQGQCK7iAGCrLGHPERPAGAPGNS5 
GQBPPEGVRQGBSLE3RfiGAMGPVTPRRGNRVAAPSIiAP<aiETH 
NP 



HCTGHDLPC2UHA»RSNGI ISNPRHSPHAjQ4AEDBPDAKSPiCIG 
GRAP PGGABAGEPTTLLQJtLRCTISXAVQinCVBGILQJJVOKPSD 
NDKLYLTLQXxPSGPTTGCKSSEPSTXSHEBYMYAYRHIRNHLBB 
HTETCLPKQSVYDAYRKYCBSIjACCBPI*STANPGKIIRH 1PPDX 
KAimiiCH3RG0SKYCYBGIRRKTLV3MPPliHJIiDJLXaSESPEKGP 
EVTPAPMJBLVBAACM/TC3)KAHRI LKRS FSS IVKVARFIjLGQH 
1*1SAR3AHAHV1KAWSLABKDEHAPRKRSSKPKHGLQIPEX3GAH 
KKPKRIAQPPKDIiKARTGAGPLARGBRXKSVWESSAPGANNLQV 
iBVLVAHLPIiLLPRAPRSLlPPIPVSPPlLAPRLSSGALKVATLP 
L55RAGAPPAAVP1 XHMILPTVPALPGPGPGPGRAPPGGL7QPR 
gtehrhvgiggdqgphdkdvkrtabvpvskasgoappaktuvkqp 

IKDTASDAKRKRGfiPLKKSGGSGBRN3TPI.H3AAAMBSA0SSRIj 

PNBTKGSGGEG3?SAGGABRPGPMGEABKGAVIiAQG\QGDGTVSK 

ggrgpgsqbtkeabdkiplvpskvsvi kgsrsqkkafplakgbv 

DTAPQGNSDIiKEHVLQSSIiSQKHKDPKATPP 



I£LQpSnTSGRIHQAKVT^BDMKSVTVBWIKNGDTOGK\BXD 
LKSIFSLNP\DL\VPDGEIBPSP\KTPPPPASSAKVHKIVHHBR 
TV\ASIKHDPPS\RIHJRWGSARARPSQFPBQPSaAQQHGSV\S 
DISPVQAAKKEPGPPSRRKSKCVKKVBKLQfiKREKRRLQOQBLR 
BKRAQDVDATHPNYBIHCMIRDFRGSLDYHPIilTADPIDEHRIC 
VCVRKRPLHKKBTQMKDlinn TI PSKDWWVHEPKQ KVDLTBYI, 
KHQTFT^FDYAFDDSAPNEMVY RPTARPL VET 1 FSRGMATCPAYG 
QTCSGRTHTMGGDPSGQfQ?M!SKGlYALAAKDVFLMIiICKPJJYKK 
LELQVYATFFE IYSOKVPDLLNRKTKIjRVLKDGKQQVQVVGIjQB 
REVKCVBDVLJCLIDIGNSCRTSGQTSANAHSSRSHAVFQIIlilR 
RtnGJfl3QPSLIDLAGNBRGADTSSADfiQTaLBGABI2JKSIjIiALK 
BCXSAUaWHPHlTPBASKiyrQVliRDSPIGENSRTCMIAriSPG 
|^CKNTIjrrhRYANRVKKLTVDPTAAGEFVRPIMHHPraQl\DD 
r.tfTQ^gvRRfipQ fffinTtin' j«g f^NBggVSPQiuartiBAVSQMPB>i 
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MMj 








BBQVVBDHRAV FQES IRWIoEDKKALLEtfrEEVUYD VDS Y7VTQLB 
AlI£QHTDILTKLRDKVKSPHAALQBKSQftSKQINPKRPRAL 


5975 


4293 


2200 


^GLC^^nrS<3lIliOAMVTSI2SH0WB5 VTVEHIHNGDTKGK\ K ID 
LBSIFSIBP\DL\VPDGBIBPSP\BTPPPPASSAEVNKIVKNER 
TV\ASIKNDPPS\aDiIRVVGSAEARP8QPPEQPSSAQQNG3V\S 
DISPVQAAKKEFGPPSRRKSNCVKEVBiaiQKKREKERLQQQKLR 
EKRAQDVDAINPNYKIMCMIRD FRGSXtDYRPLTEADP IDBHRI C 
VCVRKRPl4*KKBTQMKDIJJVTr I PSKDVVHVHBPKQXVDLTRXI* 
ENQTFRFDraFDDSAPNBMVYRFTaRFIfVBTIFSRGMATCFAyG 
QTGSGKTHTMGGDFSGKNQIKSKGITMiAARDVPLMIJCXPNYKK 
LELQVYATPFBIYSGKVFniJJTRKTKLfiVliKDGKQQVOVVGLQB 
ESVKCVKDVIiKLIDIGNBCHTSGQTSAHAHSSfiSHAVFQrrLRn 
KGKIJH3KFSLIDIJtGNKRGM)TSi&ADRQTRLBGABINKBIiIiAIiK 
BCXHA3XSRBKPHTPPRASKLTQVUID3PIOEN9R7CMIATIS PG 
HASCKNTIjmjnaNRVKEDTVDPTAftaDVJtPIMHHPPHQlVDD 
LBTQW3VQ3S PQRDDIiKLLCBQHKBEVSPQIiFTPHBAVS QMVEM 
EBQVVEDHRAVFOBS IRWLEDKKMiLKMTEBVDYDVDS YATQLB 
AIIiHQKIDlLTEDRDKVKSPRAALQSKHOASXQXNPKRPRAI* 


5976 


20 


2949 


VHHLHL»TRVSWVNLDIILHlAQQMGIKTLHliVLG\LKRA\LBK 
PBVBWKBVKDP^04KGAMLTNTGXYAIPTIDA\K7VYAIGKKEKPP 
PLPEBPS5 S S BEDDPI PDB3J»CXICKDIMTDAVVIPCCGNSYCD 
BCIRTALliESDBJn'CPTCWQNDVSPDTdjIAilKPIjRQAVNNFKME 
TGTTKRLRKQUPSPPPPIPPPRPLIQRNLQPLMHSPISRQQDPL 
KIPVTSSSTHPAPS ISSLTSKQSStAPPVSCTPSSAPAPVPDIT 

atvbisveseksdgpprdsdnkii*paaaiasehskgtssiaita 

UlEBKGYQVPVI/yTPSLI/5QS£JjHGQIdPTTGPVRINTARPGGG 
RPGWKHSNKLGyLVSPPQOIRRfiBRSCVRSlHEGRHHSERSQRT 
OGPSiPATPVPVPVPPPPLYPPPPHTLPLPPGVPPPQPSPQFPP 
GQP\PPAGYSV?PPGFPPAPAKLSTP>fVSSGVQTAHSNTIPTT2 
APPLSRBB PYREQRRLB^BBBKKKS KLDBFTHD FAKE1MBYKKIQ 


« 

; i 

j I 






ksrrrspsrskspysgssysrssytysksjisgstrsrsysrsfs 

RSHSRSYSRSPPYPRRORGKSRNYRSRSRSHGYHRSRSRSPPYR 


i i 

I | 




t 


RYHSRSRSPQAFRGQSPNKRNVPCGBTERBYPtiRYRBVPPPYnM 
KAYyGRSVDFRDPPKKERYRKWKRKYREWYKKYYXGYAAGAQPR 
PSANRKNFS PKPPT.PLNIRNSPFTRGRREDYVGGQSHRSBNIGS 


1 

I 

\ 

1 






NYPBKL5AR13CKI3QKZ2NTXSKBKBSSKAFGD0XB)IRRKKHRKRR 
KGRBSEGFLNPKLLETSRKSRKPTOVR5NKTDSXiFVI*PSRDriAT 
PVRDBPMDAB3 ITFKSVSRKDKRBRDKPKftKGDKTKKKNDGSAV 
SKKBHIVKPAKG PQEKVXX3 \DVRDLLDLNIi\QLKKPKHHTPKDIj 
TILNHBI«PLRRMKKSX i\BPP\EKIiTLilZQQK\TPRZncTSQRGlCSB 
BGLFQROQXRKAHH 


5977 


1363 


1335 


PIBDRGQVLSHFQCLSLHSINHIIjHPS&fiVAAGPAl'GW/REYLT 
PVLKKSKFKK TGV I TPBRFVAAJGDHLVHHCPTWQHATGEELKVK 
AVTjPTGXQPLVTKNVPCYKRCKQMHY SDE tKAI IKEDDGDGGHV 
DTYHNTGITGITSAVKE ITliENKDBIBLQDbsALCBBBBDBDEG 
KAADMBEYKRSGI.I.RTDBATLDTRKIVHACKAKTDAGGgDAILQ 
TRTYDI*YlTYDHY?QTPIdWLFGYDHDQHOPi*TVBHMTKDISQDH 
VKXTVTIENHPHLPPPPMCSVHPCRHABVNJQCIIBTVABC3GGKL 
GVHWILLIFIiKFVQAVIPTIBYDYTOHPTH 


597B 


160 


3213 


RDGARRWGGOQS P LTWAPG FYRRFDIATSGRRLRGQTAEPAGRQ 
RPRRBPBAMDBQSVBSIABVFRCFICMBKLSDARLCPHCSKLCC 
FSdRRNIjTBQRAQCPHCKAPLQIRBLVNC^tABBVTQQLDTLQ 
I^LTKREBNEIO^KCBNHHBKl^VFCVrrCIQCCICHQCALnGGMH 
G6HTFKPIAKIYBQHVTKVNBBVAKIJtREL«EbISLVQKWRNV 
KAVRKTVKDKRVRBIRNAVKMMIARLDTQLKNKLITIjMDQKTSLT 
QBTBLLBSLLQBVBHQLRSCSKSBLISKSSBIUmFQQVHRKPM 
ASFVTTPVPPDFTSKLVPSYDSATFVIjBNFSTLRQBADPVYSPP 
LQVSGLCWRUCVYPDGWSWBtJYYLSVPIjSLSAGLPBTSKYBYR 
VBHVHQSCNDPTKNIXRBPASDFBWCSBCWOYNSFFIILDIjIAKBG 
YUJPQNDTVILRFQVHSPTPFQKSRDQHimTQI^AAQTSYIOQ 
IHNLKERLTtBLSRTQKSRDJ^PPDNHLSPQNDDM^TRAlCKSA 
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PCT/USO 0/34263 



Amino acid segment containing signal peptide 
(A= Ala nine, CoCyeteine, D=Aspartic Acid, B= 

Glutamic Acid* F°Phenyla 1 an i ne , G=Glycinc, 

H^Bictidine, I*»Isoleucine, K^Lyaine, 

L° Leucine, M=Methionioe, N=Asparaglne , 

P^Proline, Q=Glutamine, RoArginine, 

S=Serine, T=‘T3ireoni ne, V=valine, 

W=Tryptophan, Y«Tyrosine, X-Onknown, *=stqp 
Codon, /sposaible nucleotide deletion, 

\=possible nucleotide insertion) 
CSnHrJ«BR\GPYSAS\VRBAKBPgBDBE&lQ)SEDYHSgIjSDGDir~ 

DLDLVYRD BVNQLDGSSSSASSTATS NTKEHD IDEE’IWSGHNIJV 
BYKHHBXHEGKI*MHi>AAAAGPAOSSHGYVG^SRISRRTBriCSA 
ATSSLI^IDPLmBMiDLKDR^XENLWJIiQPRPPASLbQPTA 
SYSRXDKDQRKQQPl^VPSDLKMIJCRIJniJMAKVRCMKTnVKH 
TLSE I KSSSAASt 3 WrSLPSADQAALAACGTKH 5 GRX< 3 DU 24 B 
UJUCSSVAHCYTRHSTOOCSKSPKPARSSVAOSLSLRRAVDPGB 
HSRSRGDCCJTXSEGSPGSSQSG 3 RHSSPRALIE 3 SIGDXLPKTB 
ERQC$CRLDSRAVVVAVPSGLPAVEKRRKMVTlX 3 ANAKCK 3 n 3 S 3 L 
QHTDLENNBBTGBLQPVLPEGA 3 AAPRBGMSSDSDIEC 3 DTKNKB 
gBBHTSVGGFHDSFMVMTQPPDBDTHSSFPDGEOIGPBDLSPNT 
DOSHSGR 

LPI)^fMYJL^I,KIXAFGFAFIlXrKVFVTC 5 QSPTPSpTDAYIiNASB — 

TTTI^PSGSAVXSTTTIAXTPSKPTCDBKYAtflTVDYLYtfKBTK 

LBTAKlJWKBNVKOSNNTClKNBVHNLTECKNASVSIimNSCTA 

PDKTlJ;unn?PGVHKVPVHCXX\QVEQ PD 8 TIRLKHKNXBTSTC 

DTQHITYRPQCGHMIFE^XRXKIiKNLBPKHBYKCDSBILYNSHK 

PTHASKIXB^n)PG 3 PGBPOIXPCRSKAAHQGVITWNPPQRSF 3 Df 

PTIiCYX KSTBKDCtJiU)ronjIKYDbQHL KPY T K Y VI »SLHAYl XA 

kvqrkgsaamctfttksappsqvhnmtvsmtsdnsmhvkcrppr 

DRTOPHBRTOIjgVBAGMTLVRHBSHRHCDPRVKDliQ YS TD YlFK 
AYFHPGDYPGEPPILHHSTSYNSKAJLIAFLAFLIIVTSX 7 VLLW 
LYRIYDLHKKRSCKLIJKOOHLVBRDDRKQLKNVKPrHADXIiItKT 
YKRK 1 ADBGRLFLAEFQS I PRVPSKFPIKBARKPPHQNKHRYVP 
ILPYDYBRVRXtSBINGDAGSNYXNASYIOOFKBPRKyiAAQGPR 
DBTVDDFWRM 2 W&QKATVXVWVTRCEBGMRNKCAEYWPSMKEGT 

RAFGBCCCKDLTKHKRCP\DYIIQKIiNIVNKKBKATGRBVTHIQ « | 

FTSHPDH^BDPHLIJiKIiRRKVNAFSNPPSGPIVVHCSAGVGR 
TGTYIGinAMLEGLKAKNKVDVYGYVVItLRRORCLMVQVKAQYl f 

blHQALVEYNQPGBTBVNX-SSmPYLHHMKXRDPPSEPSPLKAR * 

FQRLPS YRSWRTQHIGNQE \ENKSKNRNSNVIPYDYNRVPLKHE i > 

LBMSKKSKHD8DBSSDDiX3DSEBPSKYINASFIMfiYWKP\BVMI j i 

AAQG PI JCSTT QD PWQMI PQ RJCVKVTVML.TKLKHQDQE I CAQYWG , 

HffiQTYODlBVDI^DTDKSSTYTLKVFELRHSKRKDSRTVYQrQ < 

YTHHSVEQLPAEPKELISHIQVVKQKLPOKNSSKGNKKHKSTPL ] 

LIBCRDGSQQlGIFCALUnilaBSABTRBVVDlFQWKALSICARP 

QWSTFBQYQFLYDVIASTYPAQNGQVIKHBHQEDiCIBPDNHVT) 

KVlCQQANCVNPLGAPHKLPRAJCSQABGSBPTSGTBGPBaGVNCP 

ASPAUtfQGS . 

UAWGCKIJUlLRFTYGTQTRVSLACiPGQYKLVHTEiVATOGNVfBTI 

PEBDI^VQBNNBOAAHDLTELHVTMHHAIiLQRVDVVVAPCQGIiR 

PTVDVIX3DI,VHDFIiPVXTYALHXDBZ>SERBBQRZOBIRKYTSFP 

VFFFKVPXljGSBIIDSSTRRHESBRSPLYRQLIDLGrrLSSSHHH 

C^PG(H 3 TKAQSMLVEQSBKLRHL 6 TPSHQVLQTRI*VDAIUCAIiH 

LVBCHCLDI PSwQAFDMQRDLQITPKRLSYTSKKBNBLYRSLMH 

lASRRQBBMRDMIVRTLITrMKBELUSriATCniBFKDVlVpBNGBP 

VGTRBI KCCLRQ IQELI ISRLKQAVANKLI CGVDYLRES PVQTL 

ERCLQSLBKSQDVSVHITSNYLKQIIHAAYHVBVTFHSGSSVTR 

MLKEQIKQIIQRITWVSPPAITLERKRKVAQBAIRSLSASKLAX 

SICSQPRTRIJ^SSHBAFAASLRQIiBAGHSGRLKKTBDLKLRVRK 

DOAPRLARLSIjESRSLQGVTiLHRJCP KtGQKLGRGQYGWYLCD£i 

WGGHFPCALKSVVPPDKKJWNDlALBHfYMRSLPKHERItVDLKG 

SVIDYNYGGGSSlAVU^IMBRltHR 0 LYTGLKA 0 LTl»BTRIjQIAL 

DVVBGIRFIjHSQGLVHRDIKLKlfVLlJJKQItRAKITDLGPCKPRA : ; 

MMSGSXV(rrPIHMAPELPTOKYiai 3 VDVYAFGXLFWyiCSGSVK 
LPBAPERCASBOSiLnNNVRRGAR PHRLP VFDERCHQLMKACWDG 
DPLKRPULGIVQPMIiQGIPfi?RLCXS\NSEQPNRGIiDDST 

GRKHSAAMBJtPNGAADGL3RtfPHGLGU.T.f.I<iTJ»PPSTl^QDRL ] 

dappppaaplprwsgpigvswglraaaaxggafprggrwrrsap 

G\EDEECGR\hlDPVAKLANNTHQHVFDDI»RGSVBLSWVGDSTGV 

ILVLTTPHVPLVlMTPGQSKLYRSgDYGKWPKPITDIJMMTPlR 
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Amino acid segment containing signal peptide 
(AoAlanine, CN*Cysteine, D^Aepoxtic Add, B=» 
Glutamic Acid, F=Phenylalauine, G=Glyc±ne, 
H*-Histidine, I => Isoleucine, &=Lysine, 

Leucine, M ^Methionine; , tfaAsparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

5«5erlne, TsThreonine , V=Valine. 

W-Tryptophan , y«^Tyrosine, XcUnkxioim, *«Stop 
Codon, /^possible nucleotide deletion, 
\c*poosible nucleotide insertion) 



THPGHAIGPBKSGKWLTABVSGGSROGRrPRSSDFAKNFVQTD 

LPFHPLTQJ^SPQNSDYLLALSTENGLHVSKHPGGKWKBIHKA 

VCLAKPJGSDNTIPPITXAlfGSCKADLGALfiljliRTSDLGKSFKTI 

GVKXYSFtnxSGRPLFASVMADnnTRRXHVSTDQGDTWSMAQLP 

KSLDHHLYTTTOGETDPTNVTSLHGVY1TSVLSBIIHEIQTPI1TP 
DQGGPWTHLRKPHNSECDAIAKNKHBCSljHrHASVSISQKLMVP 
MAP^EPMAVaiVXAHOSVGDAISVMVPDVYISDDGGYSWTKML 
BGPHYYTILDSGQIIVAIBHSSRPXRVXKrST D BG Q C M /l’ y TPT 
RDPXYFTGlASBPOARSMNXSrWGPTBSFLTSQWVSTriDFKDI 
LB£NCKEKJ>YnHIaAHSTE7PEX>YEDGCXLGTKEQPI*RL5RSSVC 
QNGRDYVVTKQPSICLCSLKDPIjCDFGYYfiPSNnSKCVEQPHLK 
GHDLKP C3, YQREBHLTTNGYRU PG0KOQGGVNPVREVKDLKKK 
CTSNFIiSPKKQSlSKSHSVPIIlArVGIJILVTVVAGVLIV^KYVC 
GGRFLVHLY5VLQQH\AKA\NGVIxnmAIJ)TASminC3<mmnS 
DEDLLB 



ATRPPRGSSWCRQFSKTASAAPGRSNKIiRI PVRKALVGLSKSPK 
GCVRTTATAASHLI E W V DGQSVK V BP GTJ VL OACHKVtagjX PR 
PCTHBRI*SVAG»CRMCLVEIBHAPKVVAACiU4PVMKGWNI LTNS 
EKSK3MBGVMRPUJ«raPLDCPiaDQGGKCI)LQDQSlWG3TOR 
S&FbBGERAVBDKKlGPLVKriNTRCXQCTBCIRPASBIAGVDD 
LGTTGBGKDMQVGTYIBKKPHSBLSGWIIDICPVGAI/PSKP'YAP 
TRRPWBTOKTBSIPVMDAVGg WXW STRTOBVMRXIiPHMHBDIM 
KEWIGDKTRPAYTOLKRQRIjTBPMVRNBRGLLTYTSMKDAIjSRV 
AGMLQ3 POGKDVTAlAGGIiVnAKALVALKDLLHRVDS DTlfCTBB 
VPPTACS^TDLRStrybLNTTEAGVBEADVVItLVGTNPRKKAPLP 
JCIRUUCSHIiHNDIiKVAIjIGSPVDLTYTYDHLGDSPKILqDXASG 
SKPFSQVLKRAXKPMVVIjGSSALQRNDGAAILAAVSSIAQKIRM 
TSGVTGDWKVMHILHRIASQVAA1J)LGYKPGVKAIRKNPPKVLF 
LUGADGGCITRQDLPXDCFiryQGHHGDVGAPIADVXLPGAAYT 
BKSATYVOTEGRAQQTKVAVTPPGIARBDWKXIRAIjSEIAGMTIj 
PYDTL\DQVRHRLKKVSPNLVRYDDIBG\ANYPQQANBIiSKLVN 
QQLLADPLVP PQLTMKDFYHTDS iSRASQrrHAKCVKAVTEGAOA 
VBBPSrC 



EAHGDGC5iRRHRASGRRAGRGBP\AGLKSQGQfiAVPKRAVARGG 

RO\YSAAIALLEPAGSBlADDLSXX.TSNRAACyLXBGa«CSGCIQ 

DCMRALHLBPFSMKPIXRRAMAYBTLBQYGKAYVDYKTVIiQIDC 

GI^QLAHDSVNRLSRXIXKLDGPNWEBJCLSLIPAVPASVPIjQAWH 

PAKEMISKQAGDSSGHRQQGITDKKTPKAIiKKKQNQCVNDKHYK 

DAI»SKXSHX3jiKIWNlCBCAiyTNRAIX^rLKLCOFEKAKODCDQAL 

QLADGNVK7^FYRRAIAHianiKNYQKSIiIDIJnCVILIiDPSIIKAK 

MKTiREVTRLLNLKDJCmPFNKKKKRRiaKiaRVNEGKKKPGRPA 

GHVSTGCLASBBGGKSSRSPKDPKKLPIAKPNNAYHFGQIINAL 

STRKDKBACAHLIJ^TAPKDLPMFLSHJCLBC2)TFLLLIQSLKNN 

lxekdpslvyqhllylskaerfkmmltliskgqkelxeqlfkdl 

SDTPNNHFTLEDIQALKRQYEL 



66VQ4ACTYV31JXjOKKQRSV3FLASGLMRVSTGPELRIiHHSyVL 
TCTVGRRICBIjLVGIjFTKGDTSSKHVHPFS PGPCFLLCDLARVG 
SSPKIKVSPFYQN\QTSTQHSCTVFVHQRCSLVGPFQVTVFTMY 
PHHSLRSISRFSSG 



rrvarpgtaefakarrtvrrgrarrdlagaerkagvsergdsgr 

RRPNPS IPSAAAGHSHIQI PPGLTBLLQGYTVEVLRQQPPDLVB 
PAVBYFXRLREARAPASVLPAATPROSLGHPPPBPGPmVAnAK 
GMESBEDRDLBVPVPSRFMJUnrSVCAETYNPDKBEEDTCPRVI 
HPKTDBQRCRIXJKACKDXIiliFKMLOQJEQLSQVliDAHFBRIVKAD 
BHVXDCX33DGtiNFYVIBRGTYDILVTKW^TRSVGQYlJHRGSFG 
BIALMYNTP -U^TIVATSEGSIiffGXJJRVTFRRI IVKHHAKKRKM 
FRSPIBSVPIiLKSLEVSEHMKrVDVXGBKXYKR/DGBSIITQGB 
KVADSFYII3SGBVSIURSRTKSNKDGGNQBVB]^ARC3IK3QYF 
GBIJUjVTOKPRAAaAyAVGDVKajVMDVQAFERLLGPCMDIMKR 
NISHYKBQLVKMFOSSVDLGNIX3Q 
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5986 




484 


DAHKSTS LTFHHKLKGJOIRC31RRGIAHPKNHLS PQQGGATPQV P 
SPCCKFDSPRGPPPPRLGLLGALHAEDGVHGSPPVPSGPPMBED 
GLRHTPKSPLDPDSGXjLSCTIiPHGPGGQSGPSGERSIAPPDAS r 
I>ISNVCSIGDHVAQHLPQGSI)I/31AKKAKRPGKK\AGQHSPLRB 
EHVTCVQSILDBPLQy\VGSLlPLSIlJK V VSKUSDI FQQBFSTP 
SRJ03LVIX^IQSYQimPGNAHVRG PTWayXKHVLTMDDDGTLYG 
QHWLNDQVHNMYGDLVMDTVPBlC\VHPPN8FFV\DKLBTKIjyBG 
VKRHTKNVDlFHKHLLLIPIHLBVHWSLISVDVHRHrXTYFDSQ 
RTUVRRCPKHXAKYLQAEAVKKDHLDFIlQGfflDGXFBHIiVABQKN 
DSDCGAFVIOYCKHLMiSQPPSPTQQOMPKIJiRQryKBLCHCRI* 
TV 


5987 


1806 


484 


DAWKSTSLTPHWKLHGHHRGHRHGIJVHPKNHLSPQQGQATPQVP 

SPOCRFDSPRGPPPPRLGLlilAIilABDCVRGSPPVPSGPPMKBD 

GLRSfJTPKSPLDPDSGLLSCTLPlfOPGGQSGPKGBRSIAPPnASI 

LISHVCSIGDHVTIQBZiFQGSZUXjnAKroSRPGBKVAGQHSPLRB 

EHVTCVQSILDBFLQT\YGSIiIPLSTDEVVBKLEDIFQQEFSTP 

SRKGLVLQLlQSYQlMPiaJAMVRGPRVAYKRHVLlWOLGTLYG 

GmiHDQVMHMYGDL VMDTVPEK\VHPPNS PPY\DKLRTKGYDG 

VKRWTKNVDI FHKKLULIPIHLBVHWSIJSV^^ 

RTLMRRCFKH IAKYLOAHAVKKDRLDPHQGWKGYFKMNVARQNN 

DSDOGAPVLQ YCgHLRLSQPPSPTQQDHPKIiHRQI YKRLCHTKT* 

TV 


5988 

! 


129 3 

' 


410 


FKKYFLSPliOIiTiKS.SHSRDRlHHLVMgTiTATHHbVFfWFTCTFQ ' 
RZiDCIYLNAG IMPNPQIjMI JCALLFG1JPS\ABGIiLTQ(3)K1TADC3 
|jQBVFBTDVPGHyiT..IHKIJPLIXmi3NPSQI>lWTSSRNARKBW 
FSLHDPQHSKOKEPYSSSKYATDLLSVALHRMFNQQGLYSNVAC 
PGTWiCNLrrYGILPPP IWTLLMPAXLLIiRFFWJAFTIjTP'YNGTB 
ALWLFHQKPRSLHPLIKYI^JVTTOFGRIIYIMTQKMDLDBDTAB 
KFYQKLLBIiKKH I RVTXQKTDNQARL3GS CL 


59B9 

i 

i 


194 


2610 


AMD PPQHSQHVX^QUTQQRgiiGI i U3X^r?VVDGVHFKAH KAVLA 
ACSBYFK>n4FVDQKDVVHLDISNAAGbGQV|jBPMYTAKLSI*SPE 
HVnDVL\AVATFLQKQDI ITACHAUCSLAKPATSPGGHAKALAT 
EGGDKEAKBKKVATSTLS RLEQAGRSTP I GPSRDLKHBRGGQAQ 
SAASriAKQTKKADAPRKPPPVKUCPDPTSG54AAAKAKAAl*SBSS 
BQEMBVEPARKGKBEQKBOSEQEKBGAGPABVKEEGSQLHKGHA 
PHHJHNBBSAGTDSGQHLGSRARGLRSGTYOTRTBSKAYGSVIH 
KCBDCGKBPnrrGHPBCRH I RIHTGEKPFSCBECSXAFSDPAACK 
AHSKTESPUCPYGCBEOGKSTRI*ISLUiIiRKKRHSGBA8YRCED 
OGKLiFTTSGHliERHQIfVHSGBKPYQCDYOGREPSDPTSRMRHLB 
THDTDKEHXCPHCDXKFNOVGin>KAHLRXHlADGPLICCREOG^2 
PTTSGNLIOOIIjRIHSGBKPYVCIHCXJRQPADPGALORHVRIHTG 
HXPOQCVMOGKAPTOASSLIAHVRGJfTGKKPlfVCBROGKRFVQS 
SQIiANHIBBHDtTIRPHltCSVCSiCAPVHVGDLSKHI I IHTGBKPT 
W!DKOGRGFNRVIHIIiH5nVKTVHQGKBGIlCIl£PEEGSBVSVVT 
VDDMVILATEAIAATAVTQLTVVPVG&AVTftDSTHVIJOUSISKA 
VKQVQHEDPNTHXLYACDSCGDKFIjDAHStiAGHVRIHTAQiALVM 
FQTllADFYQQTfGPGGTHPAGQVIjOAGKLVFRPIlDGAKGQPAIAK 
T5PTAPECPPPAB 


5990 j 
1 


2 i 

! 


4700 


fgpgpd^2gahcsgwgsrsoapygtijgavsggbqvllhekagd 
SGFVSLSRLGPSI idJKDLKMBKLWbQDBTLIiGTMgSYMDAStilS 
LIKDFGSGWJEVEM3LPDPSHDF3PPSFLKTSSPKLPSWRPPRSR 
PRWGQSPPPQQRSDGBBBKBVASFSGQUAGKLDNCVSSIPDPP 
MHIACPKBBDKATAAKMAVPAAGDES I5SI£KLVRAKHPYCLPN 
LTHLAS LKDKLQEQ PDDLTl^EGCWLBIVGQAATRGD DLKI PV 
VVS0VSPGPRPVUXDSLSTSSAI^U«PT1»ESBTRAAVPKVTL 
C3HK8GLSLH3ERKLDSACUjKPREVVBPVVPKBPQHPPAHAAP 
GSQSARKGRKKKSKEQPAACVEGYASRLRSSSBGQSTVGTEVTS 
QVDHLQKQPQEELQKESGPLQGKGKPRAWARAHAAALHHSSPKN 
LBRBAGQSSPAKEGPM)LYP1CIADTIC)THPIPTHLSI»VDSAGAS 
PMPVDSVBADPTAVGPVLAGPVPVDPGLVDIASTSSSLVBPLPA 
BPVLIHPVLADSAAVDPAVVPXSDNLPPVDAVPSGPAPVDIALV 
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location 
corresponding 
to first 
ami r>f> add 
residue of 
amino acid 
sequence 


|ISB| 








DPVPHDLTPVDPVIjVKSRPTDPRRGAVSSALGG 3APQI*LVESKS 
LDPPCTIIPKVKBVVDSLKIKSGTSAriHBARPRPLSLSEYRHR 
BQQRQABTHKRS PQP PTGKWPS LPRTPTGLADI PCLVI PPAPAK 
KTAIiQRSPBTPLHICLVPVUPSPASPSPHPPVSKPVASSPTBQV 
PSQEHPIjLARPSPPVQSVSPAVPTPPSMSAALPFPAGGLGMPPS 
LPPPPLQPPSI^PLSMGPVLPDPFTHYAPLPSWPCYPHVS PSGYP 
d*PP?PTVPLV3STPGAYAVPPTCSVPWAPPPAPV3PYSSTCTT 
GPIiSWGPGPQHAPFHSTVP PPPLPPASIGHAVPQPKHKSEGTPA 

gppbnvlpi*shappls£glpctgapqteptkvevkpvpasphpk 

EKVSJUAJQSPQHKMiACVSABGVTVBKPASBMJCPBTQBTRPRB 

kpplpatkavptprqstvpbuavhparlrklsflptprtqgsb 

OWQ&FISBXGIBASDLSSLLEQPBXSBAKKBCPPPAPADSLAV 
(^SGGVDIPQKKRPLDRLQAPHLANVAGLTPPATPPHQI*WKPLA 
AVSLLAKAK3PKSTAQBGTLKPB0^3AKHPAAVRKJKGVH3PS 
HVHVGSGDHDYC\VRSRTPPKK\MPAIiLI pkvgsrhnvkrhqdi 
TI KPVLSIiGPAAPPPPCIAASRKPlJJHRTSSEOADPSAPCLAPS 
SliLSPEASPCEHDMNTRTPPBPSARQRSKRCYRKRCRSASPSSO 






* 


HKRW2RSSCSSSGRSRRCSSSSSSSSSSSSSSSSSSSSSSRSRS 
PSPHfiHSDHRfifiYSSYESHDHYQRQHVIjQKHRAIEEERWFIGK 
IPGHKraSHUCQRFSVPGKIKECnHPHVQGMYGFVTYRYAEB 
APAAIBSGHK1A(^I1BQPFD1 jCPGGRRQPCKRSYSDLI>SNREDP 
D P APVKSKPDSLD FDTLLKQAQKHLRR 


599~ 


334 


1573 


RbSSHFSQCS PS I YC \TKPDKQGNVTS PHRKJCi'KliifQKIiGliQAR " 
DLRFQHVWSITVRNNRI lMFMKYIJCAVITPBaJ»IIDYRHLNLK 
QKLFR3LPSQLSGEGQLVTYPIiPFEFRAIBM»LQYWINTLQGKl» 
SIbQPI*ILBTLS3ADGDPKHS5VDRS KLHXUiQNGKSLSBLETZDI 
. KIPKBS.IIiRILDKBKLLKBLCVSKWSDPQVPKKSSAGIDHT^KBM 
ELItLBNVyRIiADDItSNAARBEiRVZiIDDSQSIJFINIiDSHRlfVMN 
RLOTjQLTMaTFSLSLFGLMGVAFGMHLBSSLKKWmiFWLITGI 
MFMGSGIiIHRIlIjI*SFLGR/l*ARSSIASYraiKDMVHGGIVBGIj 


~~J9D2~ 

t. 

i 1 


: 2 

E 

i 


509 


Afl P D PRLVCX5VS6SG P PGGRQGQ ATKNRPLR P WNGAME KLRR Vli 
SGQDDEEQGLTAQDSQINL/SEVLDASSI*SFNTRLKNFAI CFVC 
G\TETSIIJGTGLUILPGGICLPAVPyTL®JIiAALASTCFL*5GPVK 
QLKKMFBATRLLATIVMLlXTTFTLCAAIJiTOnCKGIjAVLFCIljQ 
PI^MTWYSLSyiPYAHDAVX ROC 8 SI 1 L 6 


: 5993 


• 1650 


594 


ABGI/33KAVWAGIiGHflGfiHKEAGGATGLAIiQrVGCKI*PSAJFCKPGS 


1 

l 


i 

i 

\ 

1 

• ! 

1 

! 




BGVPSTAIHB ISLIdCKLKHPNlVRIiLDVVHNBRKLYLVFBFLSQ 
DI»KECYMDSTPGSEIiPX»HI»IKSYLFQLLQGVSFCHSHftVIHRDIJC 
PQNIXIKELGAZ KLADFGLARAFGVPLRTVTHHVVTIjWYBAPBI 
LLATRFYTTAVDIWSIGCIPAEMVTBKAiPPGDSXBIDQXLPRI 
PRNLGTPSBDTWPGVTQLPDyiOGSPPIOiTRKBLKBIVPHtB PBO 
RDLLMQbDQYDPSQRrrAKTALAHPYF3SPEPSPAARQYVliQRF 
RH 


> 599*. 


394 

i 

! 

l . 

i 

i 


1934 


AGBVQLHVHlRGJmQPQ/KAAAIIDLbPDFBPOSHPRSCXWPL 
PRPEIAMQPSKPPEVBPDLGBKVHTEGRSBPILLPSRIiPBPAGG 
PQPGIDGAVTGPRKGGSRHHAWGNOSYAKLISOAIBSAPBKRLT 
LAQ ITENMVRTVPYFKDKGDSNSSAGWKN5 IRHNLSU1S KF IKV 
HNBATOKSSWWMLHPBGGK3GBAPRRHAASMDS3SKLLRGRSKA 
PKKKPSGLPAPPBGAIPTSPVGHFAKWSGSPCSHNREEADMIfTT 
FRPRSSSMASSVSTRLSPI 1 RPESBVI 1 ABEIPASVSSYAGGVPPT 
LNBGLEI*LDGLNLTSSHSLLSRSGLSGPSU3HPGVTGPLHTYSS 
SLPSPAEGPLSAGBGCPSSSQAI^ALI/rSDTPPPPADVLMTQVD 
PILSOAPnXLLGGLPSSSKLAlCVntCPKPLHAPGPSSLVPTL 
SMIAPPPVMASAPIPKALGTPVbTPPTEAASQDRMPQDLDLDMY 
meklecdmdhiisdlmdegegldfnfepdp 




2 

1 

f 

i. 


2437 


i RPPGPGPASGAmxrTRARGSAAFVPPUPRPPSRGARRRRRIjPGR 
: GVAAUOIGPGSAPGIiPRGRABRSARGSGRCTSREERGAAAAAAA 
AEMMKBLHSLXDPXRROKLLKARPXtGLGVSKGPIiNSESSNQSL 

csvgsi»s DKKVETPKKKONDQRNR krkakp ybtsqgxgtpsgmk 
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Amino acid segment containing signal peptide 
lA=Alanina, C»Cyateine, D-Aepartic Acid, E» 
Glutamic Acid. P-Phenyl alanine , <3=01ycine, 

H -Histidine, I-Ieoleucine, K=Lysine, 

L= Leucine, M=Methianine, N^Aspareg ine , 

P« Proline, Q=Glut amine r R=Axginine, 
s=Serine, r=Threonine , v= valine, 
W^Tryptophan, Y«Tyrosine, XsOnkncnm, *=Stop 
Codon, /^possible nucleotide deletion, 
\ 3 p 0 ssible nucleotide insertion) 








I6DYPKRRVEQPLYGLDGSAAKBATBBQSALPTU43VMIAKPRI. 
DTBQLAQRGAGLCPTFVSAQQNSPSSTGSGNTEHSCSSQKQISI 
QHRQT\Q3DLT1EK1 SALEHS BKSDLEKKEGRTDDLLRANCIILR 

rqiVdegqkmlhkykVerlsrcpdnbprnflibkbkqkkmacbd 
KS> 5Q93RIJU/IHFTTVRHGASFTEQWTUGYAPQIJLI KQQERIKSQ 
REBIHRQRXKLAKRZPPAMGOAPPATNEQKQRiCSiCEHGAENBTIi 
TLABVHBQRB 1 PKLRLGHLSKEKAHIQAELHR LKHVBKLHXREL 
E^UHHEDHSQPKDHPTLHDBYLLLHULGRGGPSEVYKAPDLTBQ 
ryvavkihqi^knvrijkkkenyhkhacrbyrihkkldbcprxvkl 

YDYPSUJYDSPCTVLBYCEC2TOLDFYLK0HKLKSBKEARSIIMQ 
IVHALKYLSEI KPPI IHYDLKPCSJXLLVNGTAQGB I KITDPGLS 
KMIOT)SYBWVIX3HBI»TSQGAGTYirfIiPPKCFVVGICBPP3CISinC 
VDVWSVGVIPYQa*YGaKPPGHNOSOQDILQENTILKATKVQPP 
PKPVVTPEAXAFXHRCLAyHKHDRIDVQQLAJCDPYLLPRIRESV 
STSSPAGAAXASTSOASHNSSSN 




lSl2 


981 


IX)0ACLLG1^TLBPGILBFDPSWIGSHTQR/SHVSKRSRPGCB 

LFSIWFGSIVHBGYIiSSASBGEBFCrfKRHPHACSYGVAVGVL 

AFLTCLLYLALDVYPPQISSV1CDRJCK\AVI»SGHPVVSGEPHPAA 

FWAFLWFTGDSCYL\AWJHQV5KPKDNPLJIBGTDASPGRPSPPS 

FFSimiSLTAAXAVRRFKDLSFQRBYSTLPP\ASAQP 


5997 


1612 


981 


IXXJACLLGIllLTLRPGILHPDPSitiaSWTaR/SWVSWHSRPGCB^' 
LPSIVVK;SlVNE(miiEA8BGBBPCIVrNRNMiACSYGVAVaVL 
APLTCLLYLALDVYPPQISSVKDHXK\AVL3<fflPWSGBPHPAA 
FVAFIiWFTGIX3CYL\ANQWQVSKPKDNPI2?EGTDASPGRPSPFS 
FF3IPTWSLTAALAVRRPKDLS FQKKYSTLFP VASAQP 


599 B 1 16X2 

! i ■ 

... ■ 


981 

[ 

i 1 


IXXJACTJiGLPILTLKFGILEFDPSWIGSW-i-UH/SWVSWRSRPGCk 
LFSI WFGS IVNEGYLHSASBGHBFCI YNRHPHACSYGVAVGVL 
AFLTCLI 1 YLAI 1 DVYFPQ i s s vkdrxk\ avlsgsp wsgb phpaa 
FMAFLWFTGDSC fL\ANQWQVS KPKDNPLN KGTDAS PGRP S PFS 

pfsiftwsltaalavrrpkdlspqkbystlppVasaqp 


| S99j ; 2 

! i 

1 

I 

1 

1 

1 

1 

i 

i 

\ 

i 


1790 1 

1 

1 

i 

I 

* | 


RPPMBKARRGGDGVPRGPVLHXVWGFHHKKGCQVEFSYPPLIP 
GIXJHDSimiPKENKYI^FLALPBGAHNYQKDTVPFHLPPRNGMG 
ATVFGI3CTR\QIEAKAIJCVRQADIT»ffrVQKSVCVLSKLPLYO 
XjkjAKLOLlTUAYFKKKDPSQlSILKBLYHHMHSSLGGASLJSGS 
OVYLGI^PRDLVLHPPHKDLrLPKLILLKXKVLPYl SPVHKLVG 
ALMTVLSLPPGMIBHGLSDCSQYRPRXSMSBDGGLQEaNPCADD 
FVSASTADVSKTNLGTIREVHAG31HGEDAAHKTSBPLFQVHDSS 
EBQEPNDlUQYLKPPSRPSPDSSBSDHBTLDPSVliSDPHIiKERE 
QM3DQTOLFP BCDSVPSBSLPITVQPQAOTGQWLI PGLISGLE 
EDQTGMPIAJFTKGYIiCLPYMAIjOOHHIJjSDVTVRGFVAflATNI 
LFHQQlMLSDAIVBVKBALIQIHDPBLRiCUMfPTTADLRFADYL 
VRHVTENRDDV PLDGTGWEGGDBHIRAQPAVY IHALLAATLQL V 

lfrivia/akkic^vmvtt\srnvvqtgk\avgqsvggzvfs\sak 

1A\KSSWLSTPTT3TSQ3LTBPPDBKP 


6000 “ 


101 


1561 


TBPCRTAENCTATMSBZ^NKHSLBSSLRQLKCHFTWNLMEGENSL 
DDFKDKVFfRTEPQNRBFKATMCNIiLAYIiKHLKGCSfBAALECtdl 
KAKBLIQQEHADOASIRSLVTWCTXAlfVYYEMGRIiSlJVQIYVDK 
VKHVt3KPSSPYRl3SPE^OiEGNTRI.KrGGNQNKRAKVCFEK 
ALKKKPKNPBFTSGIJVIASYRLEMWPPSQNAIDPIJIQAIRIjNPD 
NQYLKVLLALXIJIKMREBGRKKGBGEK\LVEKALI?KAPG\VTDV 
LRBAA\KPYTK3KDEPDKAIBLLKKALHYIP\NNAYLHCQIGCCY 
RAKVPQVMNIJlBHG)fYGKRlQjliELXC3HAVAHLKBCADEANDNLPR 
VC$ 1 lASiaAIADQYKDAHYYFQKEFS KKLTPVAKQLLHLRYGN 

fqlyqnxcbdkaihhfijsgvbgcnqksrekbkmkdklqkiakmrl 

SKNGADSBAIHVIAFLQEU9BKMQ0ADBDSBRGLESGSLIPSAS 

SHNGB 


6001 


176 

1 


1038 

i 


APAH£PSR(221HTHXHTPIIHTPRCTMABSHLQSSLITASQPF8 1 
HliOmiX^GVLEGKEU^lQBLQOARKKAGLELSPEMKTFVD 
QYGQimDGKiaiVEIiAHVLPTEENPLLLFRCQQX*KSCE\EPHKT 
VfRKYD7DHSGPlETKELKNPIiKDIiLEKANXTVlJD7 KLAEYTDLH 
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Amino acid segment containing signal peptide 
(A=Alanine. C=Cysteine, D=*Aspaxtic Acid, Ba 
Glutamic Acid, F- Phenylalanine , G=Glycine, 
H=His tidine , I=*Isoleucine, Kc Lysine, 

L=» Leucine, M=-Hothionine, N^Aspaxagine , 
P^Proline, Q=Glut amine, R=?ArginirtB, 

9=3erine, T= Threonine, V~Valine, 

W=> Tryptophan . Y=Tyrosine, X»Unknown, *=£top 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








lklfdgnhdgklbltbharllpvqbnpiiLrfqgikhcqkbfbka 

PBLYDQIXJNGYIDKNBLDAIjLKDLCEKNKQDrjJIBNXTTYKKiri 

KALSDGGKLYTtTDLALILCAQBN 


6002 


977 


81 


LAPPQGGLRI PPRTPLSHSRPPPSHHAPHPSPLPhPPADLHPHS 
SMAQRHDLLBIJXX)LTRDRVVVVSimKNLCKQSGI^Rim3SLDP 
EDLPIiYKEKJLEVYFS PGHFAHGSDRRMVELEDLFQJIFPRTPHSV 
EIKGKBBELIftEQ/VbVRRYDHBBITIHASBKSGVMKKCKAANP 
EMPLSPTISRGFWVLLSYYI/SLljPFIPIPBKFFFCriiPHIINHT 
YPPFSCSCUJQLIAWSKWLIMWCSLIRHLBKHGVQWTWCIjIJB 
ESDPEAAPSVQATGVITDYPTALSHYLDMHGPAAinrS 


6003 

i 

i 

\ 

i. , 

. i 

i 

i 

\ 

i 

i 


140 

I 

! 

i 

i 

j 


4098 

i 


OKLRAFRGflRRIilCKRICDYKSFDDSBSVDGNaPSSAASAFKVP 
APKTSGNPANSARKPGSAGGPEVGAGASEBGGAGAVDSDDFI KA 
KTDVPSIQIYSSREIJEKTr2^KIRBII»SDDKHDf?DQHAHA£jaCIR 
SLLVAGAAQYDCFFQH1 jR1<IjIXZALKZ»SAEDIASQVVHBAjCZTVA. 
HLSTVIJGNKFDiCSAEAXV PTLFNLVPNSAKVMATSGCAAXRFI J 
RHTHVPRLI PLITSNC7S KSVPVRRRSFBFLDliLLQBWQTBSLK 
HHAAVhVBTI KKBIHDADAEARVBARICTIWGIjRNHFPGKAETLY 
HSLEPSYQKSbQTYLKSSGSVASLPQSDRSSSSSQBSLBRPFSS 
JW3TARPSTVAGRVSAGSS RASSIiPGSLQBSRSDIDVHAAAGAK 
AHHAAG0SVRSGRIXiAGAhNAGSYASLEDTSDKIJX3TASEDGRV 

raxlsaplagmoiakadssgbsrtemvsqsqpqsrsgspgrvlt 

TEAL3TVSSGVQRVLVHSASAQKHCXIPRSQGCSRBASPSRL3V 
ARSSRIPRPSVSQGCSREASRE8SRDTEPVHSPQPIA3RHHSRS 
TGALYAPBVYGASGPGTOISQSSHbSSSVSAMRVLNTGSDVBBA 
VADAIxLLGDrRTKKKPARRRYRSYOOiSDDDANSDASSACSBRS 
YS9RBGSiPTYMR(n'\BDVVAJSVZiBRC!?VSSHirSERKEGI(LGliQN 
LLKHQRTLSRVKLKRLCE IFTRMFADPHGKRVPSMFI*ETI*VDFT 
QVHKDDLQDWLFVLLTQLIJCKRCAinJiGSVQAKVQKALDVTRES 
FPNDLQFKIIrfiRPTVDQTQTPSIJCVnKVAXl^IBTLAKQMDPGD 
PINSSKTRLAVSRVITWTTBPKSSDVRKAAQSVLISbPBLNTPB 
PTMI*LGALPKTP0IXJATKLLHKHI*RNTCa?GTQSSMG3PLTRPTP 
RSPAHWSS PLTSPTNTSQKTI^PSAFDYDTEHMMSBDrYSSLRG 
VTRAIQHP3F!^BDMNHPLKIUmKDD(aOSMOGGPG\MSDPRA 
GODATDSS(377d i \DNKASLLHSMFimSPRSHDYBPYBYSDSIS 
PPNKSALKBAMFDDDADQPPDDLSLDHSDLVABLLKRTiSHHNRR 
VEBRK1ALYELKKLTQSBS FSVWDKHPKTIIiLLLLBTLGDKBPT 
IRALALKVIiRB I IJRH0 PARFTMYABbTVMKTLRAHXDPHKEVVR 
SAKKAA3 V\i^TSI\SPBQCIKVIXIPIIQTADYPIHIaAAIKMQT 
rVIERV5KETLNIiI>LPKIMPGI»IQGYI3®SESSVRKACVFCLVAV 
HAVIGDRLKPHLSQLTGSKMIUUILYI KRAOTGSGGADPTTDVS 

cos 


j 6004 i 

i 

{ 

i 

| 


140 

. 


4099 

' 

i j 


GKLRAFROIRRLICXRICDYXSFDDKBSVDGhRPSSAASAFKVP 
APKTSGlfPAJtSARBCPGSAGGPICVGBGAS KRGGAGAVDKDDPI KA 
PTOVPSIQIYSSRKI£BTLNBCIRKILSDDKHEWDQRAHAIiKKIR 
SLLVAGAAQYIXFFQHL!UiI«DGAI*KLSAXDtASQVVRBACITVA 
HMTVLGNKFDHtyiBAIVPTI^NLVFNSAXVMATSOCAAIRF X I 
RHIHVPRLIPLITSBCTSKSVPVRRRSFBPLDLIiDQBfytyglSIiH 
RHRAVLVBTIKKGIHnAnABARVBARKTYIWIiRNHFPGKAKTtiY 
NSLBPSYQKSLCrrYl*KSSG$VASIiPQSDRSSBSSQB3mRPFSS 
E3JSTANPSTVAGRVSAGSSKASS LPGSbQRSKSDI DVNAAAGAK 
AHHAAGQSVRSGRLGAGAI/W^YASLHDTSDKLDGTASHDGRV 
RAKLSAPLAGMGNAKADSRGSSRTKMVSQSQPGSRSGSPG^VLT 
TTALSTVSSGVQRVIiVNSASAQKRSKXPllSQGCSRBASPSRbSV 
AR6SRIPRP5VSQGC3RBA9RBSSROTSPVRSPQPLASRHHSRS 
TGALYAPBVTPQASGPGYGISQSSRLSSSVSAMRVIiHTGSDVKKA 
V7UWiLLODrRTKKKPARRRYBSYGMHSI>DI)AHSimSSACSERS 
YSSRNGSI PTYMRCrr\KDV\ABVIJJRCASSI3WSKRKBGLIiGIiQN 
UjKNORTLSRVBLKRWSIFTRMFADPHGKRVPSMPLHTLVDPI 
Q VHKDDIX>DWI^VXJ,TQ1*IjXKI^ADI»I/?S V QAKVQKALIJVTRKS 
PPHDLQFNILMRFTVDQTQTPSLKVKVAILKYIETLAKQMDPGD 
PIBSSETRIAVBRVITWITBPKSSDVRKAAQSVLIfiLFBbNTPB 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



nucleotide ; 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 




Amino acid segment containing signal peptide 
(Ao Alanine , 0=Cysteine, D^Aspartic Acid, £*= 
Glutamic Acid, P=* Phenylalanine , G=Glycine, 
B=Hlstidine, I*=Isoleucine, Lysine, 

L= Leucine, M=Methionine, N-Asparagine , 
P=Proline, Q=Glutanine, RoArgini ne, 

S “Serine, TVIfareonlnc, V=Valine, 
IfeTryptophan, T^Tyrooine, X^thifcnown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 






RSPANWSSPLTSPTHTSQNTLSPSAPDYDTEMMNSEDIYSSLRO 
VTBAIQNPSFRSQKDMKBPLKRDSKKDDGDSKOGGPa\MSDPRA 
GGOATDSSQTAL\DNKASIiHSMPTTC3PRS30YNPYNYSDSIS 
PFNKSALKKAHFDDDADQPPDDI^LOHSDLVfiBLIJCKLSNHKKR 
VKERKIALTKLMXLTQKSSFSVWDEHFKn LLLLLCTiGDKHPT 
IRAXjALKVLRBIIllHQPARFlWYAELTVMrTIKAHKDPHKBWR 

SAKKAASV\LATSI\SPBQCIKVLCPI I QTADYPIHLAAIKMQT 
KVIKRVSKBTTLNLIJjPSXMPGIdQGYlIWSBSSVRKACVFCLVAV 
HAV1GDBLKPHLSQLTGS KMKLLNLYI ECBAQTGSGGAD PTTDV3 
GQS 

HSSGRRQBQX/JQFPGRKRKGMASGliGSPSPCSAGSKKBDMnALL 
MSSLPPPHPBHKKDPKKDLSBTBTPKLKKXKKPKXPRDPKIPKS 
KRQKBERMLLCRQIiClDSSGBGPKFVBBBBEVALRaDSEGSDYTP 
GmCKKKLGPKKSKKS)CSKRSBEBEEEH)DDDDDSKBPKSSAOLL 
KDWGMKDI DHV F SEK DYRTLTNYKAFSQFVRPLIAAKHPKIAVS 
KKMMVLGAKWR3PSTKNPFKDSSGASVAAAAAAAVAVVESHVTA 
TBVAPPPPPVBVPmKAKTKEG®9PNARRKPRS3PKVPDAKXPX 
PKXVAPI»KIKLGGPG3KRKR3SSKDD£II»DVBSDPDDASIHSySV 
SDGST8RS3RSRKKLRTTKKKKKGSSBVTAVPGY Hi'UH Qby C3S V 
CQQGGHI IbCDTCTRATHWVCIiPPPWKKAPKOKW SCPH CTKBSl 
qWKAKKDNSBGKBILRBVG(33LKKKDE)HHHBFCRVCKDGGBUjC 
CDTCPSSYHIHQiTPPLPBIPNGHWLCPRCTCPALKGKVQKILI 
HKHGQPPSPTPVPRPPnADPNTPS PKPLEGRPBRQPPVKBQGHS 
YHHCSUVSBLQLKLHC\QVMFRNYQRKHDMDBPPSt>DPGat>HBK 
S\RKRKHKDPKPAEMB8RPYRVOIKPBW\MMIHRDjilKSVDKKO 
BVHYldBHRDIaPYDQASUfiSBDVBIQDYDZaFKQSyNNBRBUIRG 
KBGRPGKKLKKVKUUOjBRPPBTPTVDPTVKYKRQPEYUIATGG 
TI^PYQMKGIiNHLRFSWAQGTDTILADSMGLOKTVQXAVFIiySL 
XKBGHSKEPFLVSAPLSTI IN\HEREPEr^APDMYV\VTYVGOK 
DSRAIIRBICBPS\FEI2fAIRGGKKASRMKXEASVKFHVI*LTSYE 
LITlDMAXLGSIIJHACLIVDKAHRLKNfilQSICFFRVLilGySlXlHK 

llltctpixjnhleblfhllnfltperfhnlbgflbefadiaked 

QIKKLHDMU;\PHMI J ffRLKADVFKNMP3KTBLIVVRVELSPH\Q 
KKYYX\YlXlISByLKAIJf\ARGGG!fQV3IJjHVVMDIjKKCC3SrHpy 
LPPVAAHBAPKMPWGHYPGSAIiIRASGKIiLLIOKMLKHIiKKQGH 
BVLIPSQMTKMLDLI£DFIiEHEGY{CTSRXOGOITGNMSQBAXDR 
FimPGAQQPCPLLSTRAGCMXNLATADTVIiynsianJPMJDIQ 
APSRAHRIGQHKKVMIYRFVTRASVBERITOVAimiMLTHLVV 
RPGLGSKTGSMSBQKLDDILKPGJTEELPKDKAUDGGGaJHKEGBD 
CSVIHYDDKAXBHIXDHNQIJETKDTBLQGMNBTLSSFKVAQYVV 
RKKBKGEBEBVBRBtXKQKBSVDPDYWEKIiLRHHYEQQQBDLAR 
HIX5KQKRIRRQVHY1*DGSQBDRDHQDBQ$0HQ3DYSVASBBGDB 
DPDKRS BAPRRPSRKGLRNDKDKPLP P LLAHVGGNIBVLGFNAR 
QRKAFIMAIMRYGWPPODAFTTQtCLVItDIiRGKSEICSFKAY VSLF 

mrhlcbpgadgabtptudgvprbqlsrqhvltbigvmslirkkvq 

bpehvngrwsmpblaevehskkmsqpgspspktptpstpgdtqp 

ktppapvppaecgibikbkslkkkbsibgbkbvkstapbtaibct 

QAPAPASKDHKVWBPPBGBEKVBKAHVKBRTBBPHBTRPHGK9 

aadvbkvbbksaidltpiwkdkebkkbkbbkkbvmh»*gbtpk 

DUTOBKQKKMIKQRPMFmADGGFTKIiHSUfC^BKRAATVTraT 
YKIWHHRHDYWLLAGIINHGYARWODIONDPRXAIl^PFKGai 
NR£3IFliEIKNKPLARRPKLLEUALVIBEQLRRAAYLKMSEDPSH 
PSMAUTTRPABVBCLAESHOHLS KBS MACNKPANAVLHKVLKQL 
BBliLSDMKADVTBI^ATXARIPPVAVIlLQHSBRNXLSRlAHRAP 
BPTPQOVAQQO 



DNOPLHirrVHRHBPPVEAEPIRLLABWKDVVWDKPSSIpVHPC 
GRFRHNTVT FXIi3KKHQIJCBlJlPLHRIl)RLTSGVLMFAXTAAVS 
BRIHBQVRDRQLEKETVCRVEGKFPTBKVTCKBPILWSyKVGV 
CEVDPRGKPCBTVPQRLSyNGQSSVVHCHPLTCRTBQrRVHLQP 
LGHPILKDPIYNSVASrcPSRGRGGyiP^'iTJSRLLRDLVABHOAr 
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QSLDVLDLCEGDLS PGLTnSTAPSSKLGKDDIiSKLAAAA\QKNS 
BVZ\BARPQHI»DriAIASEKAVETDVMBQ\llQT\TLCRVPA6ATC 
SLAPRPCDVPTCPTL 



HKLGQVKyvrrDKTGTLTBNEXQPHBCSIKQMKYQBmGRIiVPB 
GPTPDGS£t3fLSYLSSLSHIjMHI*SHLTTSSS FRT3PHHBTBLIK 
HHDLFFKAVBLCHTVQ IHNV0TDCTt3)GPWQSNI*APSQI*EYYAS 
S PDKKALVBAAARIG1VKIC24SKBTMBVKTI/5KLEHY KLLH IlM 
FDSDRRRMSVTVQftPSGSKLI*FAKEAKSSIXiPKCIGGBIKKTRI 
HVDSFAZJDGLHTLCIAYBICPISKE7RBOKRr7EASTALQQR\B 
HK3 jAAVFOPXRKDLILIX 5AT3WKDRLQDKVRBTIKALRMAG1 KV 
WVLTGDKHBTAVSVSLSQGHPHR1HITXLELIHQSSDSBCABQLR 
QLARBlTBDHVIQl{GLVVDGT8I>SLALREHB£a>FHBVCRNCSAV 
LCCHMAPLQKAKVIRLI KI3PBKP1TIAVGDGANDVSM1QEAHV 
GlOIMOKEGRQAARKSirXAIARFKFIjSKLIiFVEUHFYYXRXATL 
VQY PFYKNVCFI TPQFLTQFYCLFSQQTL'yDSVYLTLY \HI CFT 
SLPILIYS3JI£QHVDFHV1 iQ^KPT1>Y]ID1SKZ3RU£IRTFLYWT 



I i fcjj.S : m ^ iu j jtvil A ( m:« »> n ^ .*i 'd y ^ 4 ’S o 



VTTVTV?CMALETHFMWlKHLVTWGSIIFYFVFSLPYGOILHPP 

LGSQNMYPVFlQIJ*SSGSiWP7VIII>fWTCI*FLDIIKKVFDRHL 

HPTSTEKAQLTBTNAGIKOiDSHCCFPBGBAACASVGRKLERVI 

raCSPTHXSRSWaASDPPYTNISSILTTLSlJffiSSTC 



A3VRBAGARRG PGRALPAGATAVP PPSARRRRRCPAPEHAG PAR 
ASRPSQKTHPQId’YNNLGSLRKRRKTVKKILLSDIGLBYCKEaiK 
DPKQFBPNDFYIiKHTTWEDVGIlfflQPSLTKNQDYETKPFCCSACP 
FSSKFFSAYKSHPMWHSKDPBSH1UJTCPYCTPHADKRTLBTH 
I OFHAPNASAPSSSLSTPKDXNKNBGLKPKQADSVBQAVYYCK 
KCTYHDPLYBXVRICHrYRKHFXlHVAAPYXAKMBFCSliNGAVPIX; 
SHRHEBSSIHCKRCLFMPKSYERLVQHVIEDHBRIOYQVTftMIG 
HTNWVPRSKFLMLlAPKPQDKKSMGLPPRIG3LASGNV\RSIiP 
SQQMVNRI^IPKPHLHSTGVWMSSVHlittQiaJYGVKSVGGGYSV 

gqsmrlglgqcupvsipqqsqsvkqllpsokjrsyglgskqrso 

APARYSIiQSANASSLSSGQLKSPSLSOSOASHVLGQSSSKPAAA 
ATT3PPPGNTSSTQKWKICTIC3JKLFPBHVYSVHFBKEHKABKVP 
AVWTYXMKIHNPTS KCLYCSRYI^PTDTLiHHML IHGLSCPY CRS 
TFNDVSHKAARKRMVUZDBStGPfCrDSTLGPDEiTLQQOSHTtflH 
LLVTTYNLREAPAESVAYHAQNHPPVPPKPQPKVQBKADIPVKS 
S PQAAVP YKXDVGKXLCPLCPS IXjKGPISEALAHHLRHHHQVXQ 
TVHPVEKKLTYKCJ HC!LGVYTSN»TTAST rn^VHCRGVGKTQIJ 
GQDiGlPlPSRmQSPSIAFVKRT Y B QM B FPLUCK P KT iT>DTy » DS P 
SFFKEKPEEPVVIJUJ?PKrai\KDOSYEAJtKSPLTKYYT\KQPYP 
TRRBXBKLAASLWV\ffK\SDIASHPSNKRKKCVRDCE!CYKPGVL 
LGFNMKBLMICVICHBKDFDABGLFEHHDKKDSRVHASKTADKKLH 

VLKVXPBDASBSBBICLDQKBDGSKYKTIHLTBBPTKLMHNASDS 

BVDQDDVVEHKDGMPSESGPGSCXJVSDFBDNTCKMKPGTlfSDB 

SSOSBDARSSKPAAKKKATMQGDRBQUCtfKKS^YGKVEGFHSKD 

QSQWKHASBSDBRLSNPQIBHQEBKSTIDSBDGSQFnNMTDGVABP 

MHSSLAGVKLSSOQft. 



CHGLQHLTPPRKLNLSLQG*BPH*AA*QAVaSKKKSIC*GSPSC 

hlvlgvlvpvas^qsshsagpaqsafr^tgtgsgtpkaaeqsgyw 

EAYTXJ5HQHWNMPPIQRPPLVMB3RRIMCXIKCBK3S*VSDSVTQG 

RAVAGBQASQRRTVPTAGGGBCIGAKSVRASVPTGKOPGVMGtiL 

NGKRGGCFKSGYIjFGPIVXGKIQSLBAKVPIiPVNGQTGKRASPG 

KCKIHIVimvC*SHffi*DHFLAAAFLRIJSTllS*VAPGSWQDHA 

vlqkkvoasvrcrgfhsvdtapagfkahsppglqgeptttsvsl. 

FVIJlPQIX^VpyvEGQLVTVLGLVVPOSIRHTPVHHTQLFLHP 
I * KLGAIiDVAPt»HLLTLVCSSFNVAYG TOOJGGTTLHQIiPAEVN 
AVTRGSAVQRRPS ITI S3IHVDTB3(XiBIiHDVM^^ 
DPFVVGIMIFHIdDDPlflQISLSPQRRV*BQCQGVKPriSQPVP 
RPLRVGLLQVGPLVRGGGRKVWGRGKROiHDIjIiFPWRWGLSHRT 
RI«d*HGGDSt3HVWIVIiCRLGSLVGGlJGTDKLI»S?PGGR* tl I IG 
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Amino acid segment containing signal peptide 
OUAlanine, CoCysteine. D^Aspartic Acid, B*» 
Glutamic Acid, P-Phenylalanine, G=Glycine, 
n=Histidine, I^Isoleucine, K=Lysine , 
L^Lcucine, M«Methioaine, lf=Asparagine, 
PttProline, 0=Glut amine, R=Arginina, 

S=Serine, T=>Threonine, V=Valine, 
W=>Tryptopban, Y^Tyrosine. X^Unknown, *=Stqp 
Codon, /^possible nucleotide deletion, 

\ -possible nucleotide insertion) 








1 * * KGRLSGBWGCSJIjGSGBLPQVSIGIGVS ivhigqgdhsvlgg 
AGLVERGALBATGQGVBALVOQI* ld>VGPAGALGXCDGAALFQGP 
GRVGOLPABGLGVCXTLV7^3KRMHDGHELGGAKHPWQALHGAAI 
CGV0GAXLLXALSQYFUQ5G*BL9CAHOQ* PVKKRQHRWEG* TR 
R*HGLTIHCPH # LI # GRVCCRLVTLRWCGLL5VH0VYGT*I>gI« 
GSPPGRLWP* mSQKRPHtMCQWKPRIAVPSWKCRHSRKHVBO 
TWRYCTPZiIMLL^GAWbGGAACGGQQOSPLSTfiQACTDPGQAAP 
LPPFDGACSPETQSCRTHVCPXAHRQLIAYTED 


6010 

S . ! 

♦ 

1 

i 

i 

1 i 


1 


3533 

. s 


IMPCXJSSRULHGOflHPifBPVSDLSYFDClKSVMSNSKVIiGKS^ 
AGISQNAK1GDUPAFGBCVG1ASKALGGLTXAAAQAAYLVGIFD 
PNSQAGHQGbVDPIQPASAHQAlQHACC^LVDPGSSPSQVLSMV 
YIVAKtfTSAGCXaCKIASSICrANPVAKBHFVQSAKBVAHSTANL 
VKTI KALDGDPS KDKRNKCRIATAPL IKAVEHLTAF7VSNPKFV3 
IPAQISSBGSQAQBPXLVBAKFNLBSSSYliIBXARSIAIHPKDP 
PTWSVLRGHSHTVSDSIRSLXTS I RDKAPGQHECDYSIDGIKRC 
IRDIBQASI»AAVSQ51iATaDDISVEAIiQB0I»TSVVQBIGHLIDP 
IAXAARGBWlOLGHKI3TQliABYFBPX>irAflVOV7^S tOXDHQQQM 
TVUJQTKTIJIESMiQKLYAAKBGGGHPKAQHTHDArrEAAQLMK 
KAVIJDIMVTT/O^AASECVOTiVGGKVnAIAKAMSXLDEGlTPPBPRG 
TFVD YQTXVVKYS KAIAVTAQEKMTKSVTNPBKDGGIASGMTSD 
YGHLftFQGQHAAftTRgPggIGPQXRTRVgDDCTGCIPl>VQPWB\ 
AXfiVCPTDSYTKSHZjIBCASAVTBKVSIjVLSAbQkSIBGTQACX 
T!AATAVGGIXAZHjOTTlMPAYAOTIjHABtC3BTPADHRBHIldCTA 
K^VBDTKIJ^VSGAASTPDKLAOAAQSSAAYITOIAEVVKLGAA 
S IX5S DDPETQ WLXNA.1 KDV7VKAL3D L ISATKBAASKFVDDPSM 
YQIJCGAAKVMVTCnrrSIiIJCTVKRVKIJEAl'RGTHAIiKATIKCIKQ 
BLTVFQS KDVPKKTSS P BBS IRMTKG ITMATAKAVAAGNSCRjQE 
DVXATMJl^RKnVSDMLTACRQASPHPDVSDBVRTRALRFGTBC 

tu^u>ij j £hvi,vi^qkptpsucqqx*aafskrvagavtei,ioaa 

EAMRGTEWVDPBDPTVIABTEIjLGAAAS X EAAAKKLEQLKPRAK 
PKQADBTLDPKBQILEAAKSIAAAT3ALVKSA3AAQRKLVAQGK 
VOSrPAHAADDGQWSQGI,! SJVARMVAAATSSIXZEAANASVQGHA 
SKEXMSSAKCVMSTAQIdiVACKVKADQDSBAMRRIXJAAGNAV 
KRASISfLVRAAQKAAFGKADDDDVW KTKPVGGIAQI IAAQEEM 
LKKERELBHARKKLAQIPQQQYKFLPTRLBBDBG 


6011 


44£ 


1835 


UXJPAMRXSPGJ^DCLWAW I LLX^TLTGRS YGQPS LQDKLKDHT~ 

TVFTXILPRLLDGYDHRLRPGLGggVTOVrotDIPVTSPSPVSDH 

DMBTTIDVFFRQSifKDKIUjKFIQDPMTVIiRliNHLMASKIWTPDTP 

FHHQKKSVAHlRnNPHKLIiXXT > 3DGrn«LYlMRLTVS\ABCPMAP 

GRDFPH\DVVHACPJLXFGSryAirraABVVYBWTRBPARSVVVABD 

GSRIJ?QYDIiI^50TVDSGIVQSSTCSYVV>TrTHPHLKRKIGYPVI 

QTYLPCIWTV1 LSQVS FWIiHltKSVPART\rPGVTTVLTMTTI*S IS 

ARNSXiPKVAYATAMDH FXAVCYAFVFSALIKP7VTVNYFTKXGYA 

WDGKSVVPEKPKKVKDPI^CKXHHTYAPTATSSTPHIJVRGDPGIA 

TIAKSftTISPKbVKPBTKPPBPKKPPHSVSKIDRLSRXAPPItLP 

GIFNLVYHATYLNREPQUOIPTPHQ 


6012 


351 

i 

! 


5013 

| 


1 P AKLPQS FAIWHXKLYDWRLG PWNQCQPVI S KSIiKXPLECI KGK 
| EGiaVRBIACIQKDHDIPABDXCCBYPEPKPLbBOACLIPCQQD 
1 CIVSBPSAWSBCSKT(35SGLQHRTRHVVAPPQFGGSGCPNIiTBP 
Q VCQSSPCEAEKLRYS LHVG PWSTC^PHSRQVRQ ARRRGKNKB 

rekdrskgvkdpearelxkkkrhhnrqnrqkhkywdiqigyqtr 

evmcthktgkrmjlsfcqqbklpmtpqscvitkecqvskwsbws 

PCS ICTCHDMVSPAGTRVRTRTrRQFPIGSBKBCPB PEEKS PCLS 
QCTGVVPCATYGWHTTBIfIBCRVDPLLSQQDKRRQttrZAliCBGG 
IQTREVYCWQAHBHXJiSQLSTHKNKKASKPHniJaX^TCPXPNTT 
QI/]HIPCPTKCBVSPWSAHGPCTYBNCNDQQGKRGFKLRKREtIT 
NEPTGGSGVTtSKTHLIiHAIPCKKPACYDWKAVRLGDCBPDitGK 
I BCGPGTXTOKVVCINSrXSBVDRQLCRimiPPI PVACDAPCPKD 
CVLSTWSTWSSCSOTCSGKnEGRQIHARSnAKAGEEGOIRCP 
1 NSSAIXJSVRSCNEHPCTVYHWQTGPWQQCIEDTSVSSPNTTTTW 
NGBASCSVGMQTOKVICVRVNVGQVGPKKCPBSLRPBTVaPCLI* 
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PC&KDCXVTP YSDWTSCPS \S CKBGDSS 1 R KQ3RHHVT IQLPAN 

ggrdctopi.ykbkaceapqacqsyr>Akthkh\hrcqVlvp\ws 

V&QDS P\GAQEGCG PGHQARAI TOUCQDGGQAGIHECLQYnGPV 
PMiTQAJDQIPCD£H)CQLT3WSKPSSCKlIDQGAVRTRKRTLVQKS 
KKKKKCKKSHLYPLIBTQYCPCDKraAQPVGHilSDCtLPEGKVK 
VIXQdKVQCTIKEOG<XlYHYQAKACYDQHGRLVBTSRCZSSHGyi 
ESACIIPCPSDCKLSBWSNWSKCSKSOGSGVXVRSKWLRBKP'YH 
SJHPCPKLDaVHQftQVYKWPCHSIXKQYUfVTBPWS I CKVTPV 
KMRESniEGVQTRKVRCKQNTAIXn^KHVBDYLCDPBBMFLGSR 
VCKLPCPBDCVTSESiGPWTQCVLPQJQSS PRQRSADP IRQPADB 
GRSCPNAVEKEPCSLHKnCXHYDYMVTDWSTGQLSBHAVOG34GI 
KTRXLDCVHSDOKSVDLKY CKAI/JLEKlfKQKJ?TSC34VECPVNCQ 
LSDWS PWSKCSOTCGLTGKfaRfiMVTQPPQGDGEPCPSI/»!DQS 
KPCPVKPCYRWQYGQWSPCQVQEAQCGEGTRTRNXSCVVSDGSA 
DDPSKWDRBPCADIBI*I iDGHKNMVLBBSCfiOPCPGDCYIiKDW 
SSlISLCQLTCVNGEDLGPGGIQVESRPVa IQELBHQHLCPBQHL 
ETKSCnX3QCyKYKHMASAWKGS3HTVWCQJlSDGINVTGGCLVM 



it v ufcc- • ) '-fi.y y*« i :■ t > ♦ e'. v'j ; ^ rfc « r I-tl 



CTLX PVWLPTHEDKSGtDVETSRAVHPTQPSSNPAGRGRTVTFLQ 
PFGPIX3 RI»KTSVVYGVAAGAPVLIiIPXVSMIYIACKKPKKPQRRQ 
NNRLKPLTLAYDGDMJM 




EAV7U3GMKKSRKNLPKGPDTDCFDKDSFMKEDFDVDHPV5DCKK 
RVQLEEUU}DLBLYYECLl*KTAMVBLIHKDYADF\VNl*STl}LVt94 
DKALNQLSVPlilQLRSBVLS LOSS VSKSXRA VDRRMS KQRDIEIK 
(^CVLRLIQVIRSVBKIBKILNSOSSKBTSALRASSPLLTGQI 
IiERIA i rKFKQLQPH7VOQSK\GMPLLDKVRPRLAGITARL0QSLB 
GLI«LBGliQfrSI>VDXXRHCLRTYATI DKTRDtARAL VGQVLVKP Y I 
DBV1 IKQPVESHPNGIKJVWYNKLLEFVPHHCRliIiRBVTGGAISS 
BKfafTVPGYDFLVNSVFfPQIVQGIiEKKLPS LFNPGE7PDAPHKKY 
TISMDFVRRLBSQCGSOASVXRXAAHPAXUSFBKKNNLPVYFQI 
RPRBIAGSLBAALTDVLEDAPABSPYCLLnSHRTWSSLRRCWSD 
EHFLPLLVHRLWRLHSGRFHASYSVFV\N\BLSI*RPJLSNESPKK 
IKKPLVTGSKEPS rTQGNTKDQGSGPSBTICPVVK ISRTQLVYW 
ADU)KLQEQLPBLLKIIKPKLBM3GPKNP98ISAAXxEDS03SPS 
ACVPSI^KIICT>LSDSCPGFXJtSALEVPRLYRRTNKBVPTTA3 
SYVDSAIiKPLPQUjSGHKDKLKOAl IQQWLEOTLSESTHKYYKT 
VSTTVLWSVKKMKKSLKRIiKOARKTrPANPVGPSGGMSDDDKIiUj 
OLAIiDVEYLGBQ TQKLGLQASDXKS FSALAKLVAAAKBQATAKQ 
P 



ABGCAKRJiGTKPWKLSMSWBSGAOPGI/5SQO<DI,VWSAWYGKC 
VKGKGSI^PLSAHGIWAMZjSRAEWDQVTVYLPCDDHKLQRYALN 
RITVWRSRSC9fKCtPljAVA31A0LZ&CiC[iIiDVTGGIiGTDELRLIiY 
GMALV3LPVHLX3BRKTKFAKVPLKCLAQKVNI PDH I VD LRHKIjT 
KKKKPHXNIX3tEGCYFVLIJWLQKTYWCR0tiEKSXiRKTWSLEEFR 
EGIKEBDQSKDKnTWDDITKQKPBPQDDGKSTKSnVKADGDSK 
GSEEVDSHCKKAbSHXELYERARRLLVSYKKBQPTVLBKFRYTiP 
KM KAHNNPSPRVKCVXAELKGVTCEMREAVLDAFIJ1DGFLVPT 
FBQIAAUJIEYERNVDLHDVLVPKPFSQFWQPIARGLHS®fFTQ 
ALLBRMLSBliPAliQISQlRPrYXLRWTVBli lVAS T K T G RHAHRP 
SAGQWBARRSWRIiPNCSASLDHPRMVE3aZ3SPC5MPQLLRII 
P\KAMGQGIXJDE\KQKKLI*R I CS IYTQSGEKSLVOEGSEAS PIG 
KSPYTLDSLYWSVXPASSS FGSBAKAQQQ8EQGSVNDVKEKBKE 

QESPTAKNAHI*LftQKRGRIiQGSJ\HQVSSBDVRWDTFP\lJGRMPR 
SRPKTPABUfLBZrYDTHVI FWTKPVL\EQRLEPSTCK\TDTLGL 
\ SOGVGS\OICSHSSS£NPRQAFIiLBARGSIiH\Gl.\ KTGLQLP 
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VKGKGSLPLSJiHGIWAMLSRAEKUQVTVYLPCDDHKLQRYAI/* 
HrTVWRSRSGNKLPLAVASTADLIRCHiLDVTGGLGTDBrJlIiLV' 
GKALVRPVHLIfiBRKTKFAKVPIiKCLROKVHlPDHIVDIiRHKbX 
HKKWPEDIDg tR OC Y P V mWLOKXYWCHQLSHSIiRBTWBliBgPR 
EGIBBKDQEEDKHIVVODITBQKPBPQPDGSSTBSnVKADGDSK 
GSEKVDSHCXKMjSHXKLyBRARBLLVSYKKBQFtVLKKBTRYLP 
KAI KMQ3WPS PHVBCVIABIJtGVTCEfniB&VLDAPlJJDGFLVPT 
PKQLAAIXJIRYBKMVDLinJVLVPKPFSQFKQPLLRGLHSCJNPTQ 
A1ABRMI»SKU»AIZJ1SGI RBrr^liRJmraLrvwmCTGRHARRF 
SACQRSARB6HRI»PNCSAS]d>VfPEUfVB5CU33PCHASPQU<RlX 
P\KAHGQGUH>B\BQEKLIAI CSIYTQSGBNSLVOBGSKASPIG 
ES PYTUSLYWSVKPASSSPGSKRKRQQQBBQGSVMPVKB BBKB 
BKBVIiPDQVKKKBBHDDQBBBKBDKDDBDDBBKORMEVGPPSTG 
0BSPTABmRLLR0KRGALQGSAWQVSSKDVRHOTPP\I/3RMPR 
SHPRTPAKIML8HYDTHVIF&nrKPVI>\EQIU^7STCK\TDTLGI* 
\SCGVGS \GHCSNSSSSHFRGAFLLKARGSL« VGL\KTGW3XiF 



UFALQTOKPALPMP PVBBIjDVMFSBLVDKIiDLTDKHREAblFAIiP 
AHKKWQ IYCS KKKDQB ENKDATSWPBFYIDQLHS MAARKS LLAI, 
EKBEKKKRSKTI BSUGAI^TKPMRFVTRFXDLDGLSCIIjNPLK 
TKDYKTSBSRIltrSiaGCXKAUOCJSQGHAHVLAHSBSIIIVlAQ 
SLSTBNXKTKVAVLBI LGAVCLVPGGHKKVIXJAMLHYQK3RSBR 
TR FQTl»TNI>f J)^CSTGflyRDKVS LKTAIMS FXNAVLSQGAGVBSL 

dfelhlryb\ flmujihpvmdkijuchbnstijjrhldffbhlrhb 

DELE FAKRPBLVHIDTKSArQKFBLTRKRIjTHS RAYPKFttS ILH 
HCIX^yKRSa>nVQYHLI»IiDRIIOQIVI<»?DKGQDPDSTPLEN 
F1TTKNVVRMLVNKNHVKQWKE0ABKMRKEHNKLQQKLBKKHRKC 
DAKTQEKEBMMQTLNKMKBK1*EKBTTKHKQVKQ<JVADLXAQLHB 
LSRRAVCASIPGGPSPGAPGGPFPSSVPGSLIiPPPPPPPLPGGM 
LPPPPPPLPPGGPPPPPGPPPU5AIMPPPGAPMGLALKKFSIPO 
PTNALKSFNWSKI*PENKLBGTVWTEIDI3TKVPKI LDLEDLERTP 
SAYQEQQDFPVHSNSKQKEADAXDI/rLSSKLKVKBLSVlDGRHA 
QglCN lLLSimCLS MIMBXKRAXUMPBQgPLPKMtLBQiLLKPVPK 

KPAKHVaBVKPKVKAIESOSSBVFR3aAIjK0IiLEVVJlAFt3fYHW 

KGQRGBAYOPKISSIilKIAiyrKSSIPKHXTLIfiYLmVHHKYP 

SVUn^BBUtDIPQAAICVHMTKLDKEISTLRSGLKAVETELEYQ 

KSQPPQPCa>KFVSVVSQFITVASFSFSDVEDLIAKAKDLPTKAV 

kbfgebagkioppsffgifdoflqavskakqbnenmrkkkbebe 

RHARMBAQLKEQREREaKHRKAKESSKESGEFDDLVSALRSGEV 

PDKPLSKLKRNMCRITWQMTIIS3RBHPITKI11P 



TlSQSGGlWtRRBAVWFEVVHMDFSRIfiMYSpPQCVPEKTGTTr 
ALSSSYSSKVIiDFBTEHKIiD PVPDS PEMSRRSI*RIATT7VCTLGD 
GEAVGADSGTSSAVSLKNSAARTTKQRHSTOKSAFS INHVSRQV 
TSSGVSYGGTVSLQDAVTRRPPVU3BSWIRElQT!rVDHFWGXJH)D 
GDUCGt5HKAAIQCTGDVGftGAArCTNGPFCfiNC5OILS3RK0VLT 
AHPAAPGPVSEVYSHDIUiQKCDDCKnKRHLnAHPGHAGTLWHIW 
ACAGYFMjQILHRIGAVGQAVSRTAWSALWLAVVAPGKAASGVP 
WWLGIGVyyiQPVTLlSWLSVFLLT’RCLRMICKFLVL LtXPL PlJJ/? 
L3LRGQG\NFFSPtPVLNWASMHRTQRVI>DPQ£IVPKPTrSIUjKQ 
PW3GDSBAFPWHWMSGVEQQVASI*SGQCHHHGOJLRELTn*IX)K 

loarvdgmegctjuspsasvrdavgqpprbtdpwaphqhhbvrms 

HI»EDILGXLJZBKSEAIQKHLEQTKQKTISAV)3EQIiI»PXVEHLQI- 

ELDQI^ELSSWHHVKTGCETVnAVQESVDVQVREMVKLLFSED 

QQGGSLBQLLQRPSSQFVSKCDIjQWLRDLQXiQIIiRSVTHHVSV 

TKQI»PTSEAVV3RV3BAG7kSOI?BAQARAlVNSALKLY8QDKTG 



VyiQPOlYP^CHAPTOGSCXJYLVVRX^SMMIHPAAFTLEHI PKTL 
SPTQJISSAPKDPAVYGl»SNEYQKEGQI»ICQPTYDQ i DGBStX)M®’ 
QMOTPIHJ'TAPQIVELRIPSWItaiPEYTCLYRFRVHGBPVtt 
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PVrrSAlHSHPGlSSQIL'lHAQGQVIGTIiE^WVVNSASVAAPAPA 
QSU 3 VQAVTPQlJJUffiQGQVlATLASSPLPPFVAVRK\PSTPES 
ItLKSBVQP IRPTPTVPQPAWZASPAPAAKPSASAPlPITCSfiT 
PTVSQLVHKRn'PSLDgDGIHLSBIRBKRKHFKIHRLSIiGLTQT 
QVGOALTATEGPATSOSAI CRFEKLEI TPKSAQKLKPVLEKHLH 
EABLRHQBGQONI/tKPVGGBPSKKRKRRTSFTPOAIHAI/IAYFE 
OfPLPTGQBUSIAICEI^YIXRgWHVyFQtRRQTItKHTSKLMVP 
OIP 



BAIQPBV3ICam2IKIT)TTCKPIASTrQySRAVFDGHrrrXIiPtf 
AHTKPVVTLTSYHBDISHRLDAVHTIIiAMAEHljQTlflBAiKSGI 
QGKI PAHQIAEI^HliJCUDBVIBDTRYTl^PbTBGKTiMVTVliDTOI 
EKLRSRSLSQIHKAAVRMRSKATDVKSTLAKIBDWIDKU<QLTB 
KPQHSKPDI IIWMIJHSBKRIAYARIPftEGVLYSTSGEHASGKYC 
GICTQTIPLIOrPQKKHHGPKVPVKIJIVHIWLGIiSAVBKKFNSPAB 
(rrPTVFAEMYBHQAliMPGBCHGTSQLVGIUlKFSDVTOKlKLKRB? 
FliPPKBWHWEGEMlVDPHRSLLTKADBGlfrBPTDHVYQNBSRYP 
GGDHKPABDXYTDRBG®KAASPSKI*TCPPGWBHBDnRltSYDIHR 
AV DBB C H B YQI TIPPDHKPKSWVAft g KW YHT HRRHRLVttKRKKP 
LTQTA 3 STWaAM^IXJDQB®lKYASI*IGWKFKNXCJRS 3 I)TFRRR 
RWHRXMAPSSTH 3 AAAI FKLBGALGADTTBDGDBKSLEKQKHSA 
TrVPGAOTPIVSQrPDBDYIYHLRCYVYQRRSriJAIiDKDSFSDP 

yahicflhrsxtibi ihstiuptodqtiipdbvbivgbpqtvlq 

HPPXVZMBI>FnHDQfVGKDBFU 3 lSIP 6 PVVKUT 3 S 4 DITPKUj 9 f 
BPVMNGDKAOCnJVLVTAKIiIlRGKDGSNliPILPPQRAPiCLYMVP 
(X 31 RPVVQLT?iIKII»flWQliaWKI?FQMAS ITSPSLVVBOGGBRV 
BSWIJQSLKKTPfTPPSSVIiFMXVFIiPKEKLYftPPLVIlCVIDHRQ 
FGBKPWGQCTI BRLDRPRCDPYAfiKKDIVPQLKASIiLSAPPCR 
D IV IBM KDTKPLIA 5 KCLSSMSTAbS KXAS PATVHLT E KKKKI V 
DWWSKFXASSQBHKkCt^riQKBySICLKXYNCBLBtlVaBPBGLT 
DPSDTPKr.tRGKSDKNEDPSWGRFIOGSFiayPiPDDPSVPAPP 
TOFim,PDSVPQBCTVRI YIVRGlBLQPGDHNGbCDPy I KITLO 
EOCVIB\DRDmPNri^PVFGRMYBLSCYLPOHXDLKISVYDn? 
TPTRDKKVGBT I IDbENPP\l>SRPO\ SHCG \ I PBEYCVSGVHTW 
RD 3 IJi\PTX 3 \LliaHVARFKGPPQPIbSBDGSRIRYGGRDYSU>B 
PBJWKXmpHLSAPHBRLALiiXLRTQGIiVPBBVBTRTmSTPOP 
HIS\RYYLRVirWHTK 0 VIIJ) 8 KS 3 TGBBWSDIYVKGHIPGHBB 
NRQKTDVHXR 3 IjDGEQJPHWHFVPPPDYLPAEQI»CIV 7 VKKEHFW 
SIIXyrKFRIPPR\LIIQIR\lRTOKPS\U)iymiPPRrrLTCfiHTI 
HFLQKSPCGHC/RGL»IPDIJC 7 VMNPIilCAK!IA 3 LFEQKSMKOHH 
PCYRRKDGARVMAQSCVKHTIiBILNBKBADKRPAGKJGtRDBPNMKP 
IGUDLPHRPgTSPliWPTMPC K IM K F r VKRRPKWVlIGIliPLLlIJi 
I^VAVIJUYSIiPHYLSMKIVKPHV 



ElVIQPEVSIGNYiafKFDTTCXPLASTTQySRAVFDGNYYYYLJPH 
A J T KPV V TLTSYWKDISHRimVtmiAMaRRLQTHIKAiKgG I 
QGKI pMffQIABI^KLIDBVIBDTRYTliPLTBGKANVTVLDTQI 
RKL{£KSI^IBEAAVSMRSBATDVKSTZABIBDHLDXLMQLTB 
BPQHSMPDI I IWHIRGEKRIAYARI PAHCVLYSTSOKNASGKYC 
GKTQTI PIiK 3 fPQBKHNWKVPVBI*RVNIWIiGI*SAVBKKKNS PAB 
GTF 1 VEABMYBSQALMFOKKGTSGLVGRHKPSDVTGKIICLKREP 
FLPPKl^BWBGRVfTVDPERBLIfTBADAGBTBPTDEVyQNKSRYP 
GGDWKPABDTYTDASnSXAAS PSELTCPPGWEtfEDDAWS YDIHR 
AVDKW 5 WBYGITIPPDRKPKSWVAABKMlfHTHRRRRL.VRKRKKD 
LTDTASSTAGJVMKKLQDQSGWBYASLIGWKFHWKQHSS DTFRRR 
RNRRKMAPSBTHGAAAIPKLEGALGAD 7 TEDGDBKSI®CQKHSA 
TTVFG^NTPIVSCHPDRDYIYHLRCYWQARNXdiALDKDSFSDP 
YAHICFLHHSKrrBIIHSTLNPTWDOTIIPDBVBIYCJEPQTVLQ 
NPPKVmffiliFDNDQVGKDKFLGRS IFSPWKLNSEMDITPKLLW 
HPVMNGDKAOGPVLVTAgLlLRGKDGSHIJXLPPQRAPHLYMVP 
QGIRPVVQLTAXBILAJR 3 LHHMiQIPQMASlTSPSI*VVEOGGBHV’ 
BSWI KNtiKKTPHpPSSVLFIOCVyLPKHBljYHPPLVIKVIKIRQ 
pghkpwgqctikrldrprcdpyagkbdivpqi»kasixsappcr 
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Amino acid segment containing signal peptide 
CA^Alanine, C “Cysteine , D=Aspartic Acid, B=» 
Glutamic Add, P^ Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K^Lycine, 
L=Leucine, M-nethionine, N=Asparagine f 
P= Proline, Q=Glutamine r RsAryinine, 

S=Serine, ivrhxeonine, V= Valine, 
if “Tryptophan , Y-Tyroaine, X=Cnknown, *=3top 
Codon, /-possible nucleotide deletion, 
\-possible nucleotide insertion) 



DivxKM8irrKPUJisra*ss>trEAr^ra<AS patvhltkxbkb i v 

DWWSKFYASSGEHBKCXJQYXQKGySKLKIYHCKLEIfVAETBGLT 

DPSDTPKLYEGKSDEHEDPSVVGEFKGSFRIYPIiPDDPSVPAPP 

RQPRKLPDSVPQ2CTVRXYIVBGl*EIiQPQIffiNGLCDPYIKrTLG 

KKVTBVDRDHnpHTLHPVPQWfYBLSCYLPQEKDbKISVYIJYD 

TFTRDKHVtTBnXDLBHPF\IiSRFG\3HCG\lPKBYCVSGVNTW 

RJD$Ul\PTQ\I(LC2SVARFKBFPQPXLSBDGSHlRYGGROYSLD5 

PEANKXLHQHLGAPBERIALHTLRTQGLVPEHVBTRTLHSTFQP 

N1S\RYYIjKVIXWKTKDVIIiDHKSITGHBII3DIYVKGWXPGNBB 

NEQKTDVHYRSLDGBGSFZOiBFVFPFDYliPAEQLCCVTkXKEHFIf 

SIIXJTHPRIPPRVLIIQIWVDHDKPSVLDDYLGPPFlTtiTCRHTI 

siplqkspggnc/hgldhipdlkajwplkaktaslfeqkshkgww 

PCYABKDGARVMAGKVBMTLBmfBKHADKHPAGFOGRDEPNMNP 

KIiDLPNRPBTSPUfPTlfPCKTWKPIWRRPKHVIIGliIjPIiIjIIiL. 

LPVAVLLYSbPKYLSMSJVKPKV 



BAIQFEVSlGEfYGNKFinTQCPIASlTQYSRAVPDiafYYYYliPH 
AHTKPVVTLY3YW8DXSHRI11AVHTLLAMABRLQTNTRAIJCSGI 
QQKI PAMQIAglJJLKLIDBVlEDTttYTLPLTHGKAHVTVIJyrQI 
RKLRSRSLSQIHEAAVBMRSBATDVKSTIABlEDWLIJKlJfQLTB 
RPQHSMPD 1 1 IWMIRGEKRLAYARI PAHQVL YSTSGEHA3GKYC 
GRTQTI^UWTQBK®3GPKVPVBIiKVHXWL0LSAVBXKPHSPAB 
GTFTVFAH4YERQAI^PGKKGT5GLVT33HKP80VTGKIKLICREP 
PIiPPKGWBKBOBWrVDPBaSLLTEADAGICrBFTOBVYaHKSRYP 
GGDVfKPABDTTTDASGDKAASPSELTCPPCTCKWKDIiAWSyDINR 
AVDKKGWKYGITIPPDHKPKSWVAJVEKMYHTHRIIRRIjVRKRKKD 
LTQTAS5TAGAMSBLQPQBGffEyA5LIGWKPRHXQR5Gim 7 RRR 
RWgtHMAPSHTHGAAAIFBdLBGaVLGA DTTKDGPB KSLBKQKHSA 
TTVFGAl/rPIVSCNFDRDYIYHIJRCYVYQAHNLLALDKMFSDp 
YAHICPLBRSidTBlIKSTliNPTRDQTIIFEPVBIYGEPQTVI^ 
NPPKVIMKLFDNDQVGKDEFI/3RSI F8PWKLNSEMDITPKUL»W 
H PVMNODKACGDYLVTAKHIjRGKDGSNLPILPP qrapnl ymvp 
QGXRPWOLTAIEXIAWGLRNMKHPGMASITSPSIjWECGGERV 
KSWI KHLKKTPflPPSSVXiPHKVPLPXBBLYMPPLVI KVXDHRQ 

VT5R kfwgqctihrlo&prcdpyagkedi vpqlbas llsappcr 

DIVlBMEDTKPLLASKCbSSKSTALSKHASPATVHLTSKEBBXV 
DWHSKFYASSGBHBKCGQYIQKDYSKbKI YNCELBNVAEFEGLT 
DFSin , KiaYIKSKHMIEDP$VVGSFKG3FRIYPLPDD?SVPAPP 
SQFRBI^DSVPQBCTVlU^YXVRGLKI^PQD^fGIXI) PYIKITLG 
KKVIE\DRI)HYXP2nijNPVFGSMYEljSCYIiPCEKI?I»KXSVYDYD 
TFTRDKKVGBTI IDLB*?PF\MRFG\SHCG\lPBBYCVSGWrW 
RDSbR\ PTQ\liM}HVARFKGPPQPIIiSEDGSRXRYGGRDYSiit>B 
FKAHKI LHQHI/iTVPBERliAMTuRTQGbVPBHVBTRTljHSTPQP 
NISXRYYLRVTrMHTKDVHjJBKSITGEEMSDXYVKGCnPGNKB 
MXORTDVHYRSLDGEGHSNWRFVFPPDYbPAEQX'CTVAKKKHFH 
S IDQTEPRIPPR\I*I IQIW\DHDKFS\JJ)OTW?FPRTLTQRHTI 
HFLQKSPGGHC/RGLDMIPDLETMHPLKAKIASbFEQKSMSGifW 
PCYAEKDGARVKAGKVBMXXjBIbNEKBADERPAGKGRDEPNMNP 
KLDljPHRPBTSFUIPtNPCKTWKPIViniRPKlHVrXIOIiLFIililLL 
iJPVAViaiYSLPNYI^MKIVKPHV 



S FYLKANCKVCPVALIQSFSHYS IVOQXLGVSLTMARKRGQLVP 
LEGIi/ lVCSGR\VFOAQKBPEPLQFIiRHANAGNLKPI«FEFVREA 
LK3VD^KARWTYPVLLVDDLSVLLSI/31 GAVAVIjDPIHYCRAT 
VCVffiLKGNWVLVHDSGDASDinSHDIIiiiGLSHQSHLIIiRAEGL 
ATCFCRDVHGQLRimRRPSQPAVHRDQSPTYQYKIQDKSVSFF 
AKGMSPAVL 



FLSFLCYPRFRCLFCLQPA1PA8HMSQI«NKIJJ«01EKSPWEBAB 
LPABLFQKKVVAS^inVLS’IXaCJHRYLVlAVHTVQtfKKGNCSK 
RLVITASQS U2NKELCI IrfdHJtf CSVPVEPGDI IHLEGDCTSDTPf 
X XDXDPGYMLYPDMLI SGTS1 ASSIRCMRRAVI*SBTFRSSOPA 
raQ^GTVIiHEVpQKArNNSPAPKKLQELAPOTIQBIRHLKEM 
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FTTFTT*^- 



DEIKQKVKDYLPSPCKHACTFMHKIITSTIJFPQMQIjSL 
PSDNS KEMSTCNIBWXPMDIBESDfSPRFGXJCGKI DVTVGVK I 
HRGYKTKYKIMPLBUrrGKKSNSIBHHSQVVI.rrLLSQBRRWJP 
BAGLLLYLEIGQMYPV?AHHIDERELIiKUQ7QM7^FSIiFHRIS ks 



• •<«);« v») r vS yi) > >:i :<;■<» <♦<: v.-p tc-i . (V-i i 1 



SVPIVMLPKIREE7QHLjQQYHI«BYPSIAfCI£0»TIiBSQSKI]i7KKN 
RQNIWIKPASBXEKSGSCIGNLIRMEHVKIVCDGQYIiHNPQCECH 
GAXPVlHIMAGDKVTVSGEERSltFMsSSGTVKBnOITTVTCLLD 
RHLSVLPBSTLPRIiDQBEXNCDlznpt/aakSKLMBNTFVSKRLR 
Did XDFREPQFISYLSSVLDEDAKDTVAClliXGIiNKPQHQAMKK 
VTiI>SKDYTLIVGMPGTGKTTTTCTLVI£LLYA£X 5FSVLLTS¥THS 
AVDKIIIiLKIAKFiaGFLRSaVQIQKVHPAlQQPTEHHICRSKSI 
RS \LALI»BKl*YrSQLl DATTCKGIHHPIFSRXIFDFCXVD8AS Q 
ISQPICLGPLFFSRRFVLVGDHQQLPPliVIjfrSEARALGMS&SliF 
KRIiE^KRAVVQUrVQYRHStSlOMSLSNKLTTKGKLECGSDKVA 
SAVXHI^HFKDVKLHLBPYADYSiaTPHIWGVFBPNHPVCFIjWTD 
KVPAPEQVBKGGVSNVTKAIdil VFLTSXFVMGCSPSDIGI IAP 
¥RQQI*KI INDLiIiAKSIGMVBVNTVDKYUD\HDXSIVLVSPVRSW 
KDGTVGBLIiKDffSRTMVAITRAKHDiIIibGCVFSUTCSPPIiEKI* 
LHHUJSEKKI IDXtPSRBHBSLCBXLSDFQRB 



GGFPAOSDHLPPVPPliR8DIiLrmSTLyV6PHDDAFPSIiHAIiIA 
ARYGEAGBG PGWGQAHPRICLQ PPPraRTSFPPPRLPALBQGPG 
GLWWGATAVAQLLSSPAGbOGPGGSRAAVLVOQSVSYADTKLJ P 
AAOGATIiPAIiGIjRSSAQDP0AVIiGAl43RAI*SFIiKEWIlUBTVlJ\ 
GBAPTLADLAAVTAliLIiPFRYVLDPFARRIWHtfVllOfFVTCVRQ 
PHFRAVLGBVVLrSGARPIiSHQPGPRAPALPKTAAQLKKKAKKR 

bklbkpqqkokiqqqqpppgbkkpkpbkrbkhdpgvitydlptp 

PGKKKDV^PMPDSYSPRyVRAAWyptfWKQQGPFKPKYGRPNVB 
AANPRGVFMMCIPPPHVTOSUMDHALTNAIQDSLTRWHRMROE 
TTLWNPGODHAGlAIXJVVVRKKLWRBQGIiSRHQIiGRHAFLQBVW 
KWKBEXGDRIYHQLKKIASSLDWDRACFTMDPKI»$AAVTHAFVR 
LHEEGI I^TRSTRLV^WSCTLNSA1SD15VDKXKLTGR11 iIiSV?G 
YKBKVBPGVLVS J^YKVQGSDSDKRVVVAITH I KTMLGDVAVAV 
RPKDTRYQHLKGKHVIHPPliSRSliPIVFDEFVDMDPGTGAVKIX 
PAHDQNDYBVCQRHtJLRAISIMD3RGALXNVFPPPLGI»PRFEAR 
KAVLVALKKRGLPRG I EDKPMVVPLCNI^KirWVRPLLRPQWYVR 
CXIBMAQAASAAV7HGDLRII*PBRHQRTWHAK}iDNIRH \tf CMFPG 
KLHNG\HR\IPAYFVTVSDPAVPPGm>PZX»YHVSGRHBABARE 
KAAKBFGVSPDRISIvQQDBDVLDriWPSSGIiFPLSIIiGHPHOSED 
I^VFYFCHTitiETGHDIIiFFtlVARMVMLGLKI/ItsRIiPFRBVYIjHA 
IVRDAHGRJO^KSbGNVirPLDVIYGlSLQGIJWQLLHSNItDPS 
KVKKAKBGQKRDP PW3IPB0GTDAI*RFGIiCAYMSQGKDlliI»DVIf 
RlLGYRHFafKLWNATKPMJlGIiGKBFVPS PTSQPGGHBSLVDR 
W1 RSRL7^VRLSNQGFX)AYDF?AVTTAQYSPWLYKLCDVYI^ 
I<KPVUK»VI)QVAASGARQfrLYTCZJ>VGl>Sl«IiSPFMPFVTBHLPQ 
RLPRHMP0APPSIjCVTPYPEPSEC3HKDPBABAALBI*M»SrrRA 
VRP\LRADYNLHPESOPTCFLBVAD\EATGALASAV5GYVOGPG 
QAQVVVAVAEP>»LPAP\QGC^VAIiASDRCSI\HLQLOG\LLX>P 
ARKU5\lOiQ\AKRVKAQ\RQAQ\RLR\EaPA\ASl3iPVKVPL\E 
VQEADKAKLQQTEAKLRKVDRAIALlFQKML 



GPITFXJGOCARMKDMPLRlHVIJjGlArrTLVQAVDKKVIXZPRIjC 
TCBIRPHFTPRS IYKBASTVIX3miX^TFPAKIiPANT0I bLLQ 
T8HlAKIBYSTDFPVHl.TGLDI*S0niIjS3VTHXWGKKMPQliIiSV 
YiSKNKLTEI^KKCLSELStlLQBLYINHNIiLSTISPGAFIGIiHN 
IJJaHUJSNHIiGMINSKimiAIiPrJLBILHIGEHPIIRI KDMNPK 
PLrHLRSLVIAG INI>TRIPnH7lLVGLENI*B3 ISFYDHSLIKVPH 
V7dJQKVVNLKFU>U(KNPINRIRRGDFSNMLHI«KBLGlNNMPEL 
ISIDSIAVDHI»PDLRKIHR!I10IPRLSYIHPKAFFRLPKLESIjMI» 
WSHAl»SALYHtJTIBSI»PNLKEJS XHSNPIRCDCVIRWMHMHKTH 

IRPMRPDSIJ'CVDPPBFQGONVRQVHFRDMJffiiCliPIjIAPSSFP 

SMLNVBAGSYV5FHCRATA\BPQPBIYW1TPSGQKIiLPMT\I>TD 
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KFYVBSEOTLDINGVTF KEGOLYCCUVniliVGADlfKSVHI KVDG 
S FPQDMKGSUH OHDIQANSVLVSWKASSKTIJCSSVKWTA PVK 
TBRSEAAQSARIPSDVKVmn»THU?PSTBYKIClDIPTIYQKNR 
KKCVMVTTKBlilPPQKBTgKIWinnffLMftCLGOliLGl IGVICLIS 
CLSPBMKCDQGHSYVRNYUQKPTFAI/3BI*YPPI*nJWJEAGKKKS 
TSLKVKATVIGLPTNHS 



ggrrapgrpgrs ikdkkbbtvfrewsfspdpi*pvryydkdttk 

PISFYI^SL2ELLAWKPRLKD6FTWAI£PLW3?QPP1SSQHPST 

UiCHCMlCQ^n^Dl^IQSSVVQTPXAFrHHQCIDVFVYPSHHTV 

QAVWJRLVQIT\RPFaFDGWLINIB5lSLStWW(2IKPPPLRYLT 

TQLHKQVPGGLVMrn>SVVOSGOLKRQDKLNQHNRVFPDSCDGF 

PINYHKHRBHLHBMLGQAGSRRADinrVOVDfVPAHGHVVDQRFDT 

DKVGGGF&PRASGPVPPIXjPHPUIDIiPFPSAPQKNDSSCSSQSG 

dpvalrnecpapaklcph 



M CI j a QA gftFHGBI BDTOQWlVTDTSIfflLlW\Si(PLGGI«PgT 7 uCEQ 
IOTHNBVCAAP&AKBBTTKSIiMQKGQQNLARCPKSABTIiIIDQDZ 

msi*kkiwesvbtklnkr\kt\kleealhiaVmbphnsl\qdpih 
fTLTQABQTUJUASRPS LXUXTVLPQIDBHKVPRHRVHSHRBQI I 
KbD?artn , HLKYFSQKQDVVLIB3SIIJjISVQSRJfEKWVQRI»VERGR 
SLDQARKRAgQPHgAHSKmBWLSESBKSLDSBX^IAMDPPKIK 
TCJIJ^HIKBFQKSUjWCHSVYDTTRRTSRSIiKEKTSIiMJDHIiKLD 
DKLS3LftDKHDTI00KSVERQIiKLKSA\lJiFSGQFTDALQALlD 
HI»YRVBPQIAKDQPVHGDIDLVK»I*ICNHJCAPQKKbOKRTSSVQ 
ALXRSARHIjIBGSRDDSSWFKVQPIQSLSTIHJETVCALS iskqtr 
LKAALRQAEEFHSVVHAIiIJKWIAEAEQTLRFEGVLPDD hdalrt 
blDQHKEPMKKLKSKRABLHKATTKGOTVLAJCHPDSITTIKHH 
ITZXRARFEHVLAWAKQHQQRXASALAGI»IAKQBIj1jHALIiAWL(? 
KABTTLTDKDKBVI PQBIEBVKALIAEHQTFMBEMTRKQPDVDK 
VTKTYKRRAADPSSLQSHI PVU>RQRAf3RKRPPASSLYPSGSQT 
QI BTKNPRVHbXiVSKWQQVWIJJUjERRRKLNDALDRLKgLRBPA 
NPDiT)IWRKHYtniW03HKKSHVMI>FFRRIDKDQpDGKlTRQKFID 
OILSSKFPTSRLEMSAVBDIPURDGDOyiDYyKPVAMiHP'KKnA 
T^I*niADKIKD] 5 VTRQVAK£ 3 lCAKJlPQWBOIGB 9 SnCfRPPLGNQ 
FGDSQQLWiVRIIiHSTVMVKVGGGWMALBBFLVICNDPCRAXGRT 
NKBLREXFTLADGASQGMAAPRPRGnRRSRPSSSGASPNRSTSVS 
SOAAQAASPQ\n?ATlTPKlLHPLTRiJYGKPHLTOSKMSTPCKAA 
ECSDFPVPSABGTPIQGSKL5UjPGYljSGKnFB3<3SDSGLITTAA 
ARVBTQPAOSKKTPSRPGSRAOSRAGSRASSBSaSDASDFDZSB 
iqsvcsdvktvpqthrptphagsrpstakpskiptpqrkspask; 
LDKSSKR 



0GSSRLLRGCITIHFNBPVSD1*SYFDCXBSVMEHSICVLGESK 

AGISQMAKTGDLPAStSKCVG IASKALCGIiTE^AAQAAyLVG I PD 
PNSQA^QGLVDPIQFARAHQAlOlAOQNbVDPGSSPSQVLSAA 
TXVAKBTSAIX3JA£3lIASSKTAOTVAHRHPVQSAKBVAItST3VNI* 
VICT t KMiDGDFSEDNRNKCRIAEAPLIKAVE3SI»XAFASNP bpvs 
IPAQI 3SEGSQA0BPH*VSAKPMLBSSSYtiIRTARSLAINPKDP 
PTWSVLAGHSHTVSDSXKSI»ITSIRDKAPGQttHCDYSnx»XHSC 
IRDIBQASLAAV5Q3LATRDDISVEAL0EQLTSVVQBIGHLIDP 
XATAARGEAA(^ j GHKGTQXiASYFEPX»ILAAV 5VASB^ILDRQQ(^I 
TVXiXJTKTLABSTVLOMLYTUUCBGGGN P KAQHTHDAI TKAAQLMK 
KAVTOIHVTLKBAASEVGLVGGMVnAIAKAMSKLDBGTPPEPBX; 
TPVDYQTTVn/KySH7ilAVT^BHMTKSVTNPE8U3GI*AS®lTSD 
YGHIAFQGQMAAATAEPEKIGFQIBrESVQI>U5HGCI FLVQKAQ\ 
ALQVCPTDSYTKRKLI BOUlAVrBKVSLVl*SAL0AGlJKGTQACI 
TAATAVSGI lADIJ3TTIMFATAGTI/!lAKNSETFW3HMNTLKTA 
KALVEDTKXiVSGAASTPDKLAQAAQSSAATITQLABVVKIiGAA 
SLGSDDPBTQWliXKM KDVAEM*SOLI£A7RBAASKPVODPSM 
YQlJ0G3UVKVM\rniVTSId^TVKAVH)BATHGTRALBATIBCIKQ 
BLTVFQSKDVPBKTSS PBRSIRlITKGrnrMATAKAVAAGNSCRQB 
DVIATAKbSRXAVSDMIjTACKOftSFHPDVSDBVRTRALRPGTBC 
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Amino acid segment containing signal peptide 
(A=Alanine, OCysteine. D=«Aspartic Acid, B=* 
Gl u.tanic Acid, F<=» Phenyl alanine , G=Glycine, 
R-Riatidina, I« Isoleucine, K^= Lysine, 

Leu cine, M-Methioninc, N=» Asparagine, 
PoProline, Q=Glutamine, tt^Arginine. 

S=Serioe, T=Threonine, V= Valine, 

R°Tryptophan, Yaiyrosine, x=Unkncnra, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



tLGYLDLLBHVLVI LQKPTPBLKQQLAAPSKHVAGAVTELIQAA 
EAMKGTEMVDPEDPTVTAETBLLQAAA3IEAAAKKLBQLKPRAK 
PKBADBTLDFBEQlLEAAKSIAAAXSALVKSASAftQRKLVAQGK 
VOS I PANAADDGQWS<^LISAARWVAAATSSWBAAHASVQGHA 
SKKKLXSSAXQVAASTAQLLVACKVKADQDSEAMHRLQAAGNAV 
KHASCNLVSAAQKAAFOXADDDDWVKTKFVGGXAQl IAAQHEN 
IXKZaELBERRKKLAQIBQQQYXFLPTRLBEDEG 



FPGtGSPALQUBVLlCLGLHClLRRMjNVLAPlFYRNIVmjjTBM 
AJWSLAWTVTSYVFLKFIXXKKHXJSTGFVSNIinYIJflRVQQF 
TSRRVELIJT^IiaBLSIJlWHIiGRRTGBVLHIADRGTSSVTGLL 
SYLVFNVTPTLADI 1 1GIIYFSMFFHAWFGLTVFLTMSLYLTLT 
TWTKWRTKFRSAMJTQENATHAHAVDSIiMFBTVKyirMABSTfB 
VRR YRKAI 1 KYQGIJSPfKSSASLVIiIilJQrrQNLVIGLGIdjAGSLIjC 
AyFVTEQKLQVGDYVLPGTYI IQLYMPLNWFGTYYRMXQTHFID 
MEKMFDLLKK\2TKVKDLPGAGP FRFQKGRIBFHNVirFSyADGR 
BTLQDVSPTVMPGQfTLALVGPSGASKSTILRLLFRPYDISSGCI 
RIIX50DISQVTQALFRFSHWHLCPKDTVLPNI)TIADNXRYGRVT 
AflHDBVRAAAQAAGIHDAIMAFPEGYRTQVGKRGLKLSGGBiQQR 
VAIASTILEAP6I ILLDHATSALTTSNERAIQASIAKVCAKBTr 
lVVAHRLSTVVNAD0II»VIKDGCIVERGRHRAU^SRGGvy7VDMW 



KSNVKKAGKSKSNDSREGLEDAVDGADHALQKAIKS 
OSSS PQRVQ&PHSSPPRFVTVEBLLETARGVntHAIaAHEIWNO 
DPQIKPVKLPEHSIJWRVKBrVHKAFVDCLSVULSSDPPAyDfiA 
IKLVGHIKBTLLS FLLPOHTRIiRKQ ITBVLDLDLIKQHAENQAL 
DISSCIiAEFIIGMMGTLCAPARDKBVKKIiKDIICBIVPIiFaErFSV 
LDI/1KVDHAHFAI SSIRPHLMQQSVBYERKKFQBI LERQPBJSLD 
FVTOWl^KASKDUmjKTKHALPVQGMAAGSGDMPRI^PVAVQN 
yAYLKLLKWDHLQRPFPBTVU4D0SRFHBL0LQ\RBQLTILQAV 
LLVTFSIffiAPGISSOADPAEKLKMIVKrLLTDMHLPSPHLKDVL 
TTIGBUCVCLBVSSCLSIjCGSSPFTTDKBTVLKOQIQAVASPDDP 
IHRlMHSItniTFLBTYI»ASOHQKPIiPTVK>QI»SPVQRRLBEVAI 
KFARLVNYHKMVFCPYYDAI LSKILVRS 



AARLCRAQPTKSANMIRDLS KMYPQTRHPAPHQPAQPFKFT I SB 
SCTRIKKBFQPLQAQYHSLKIJBCBKLASERTEMQRHYVMYYKMS 
YGI^IEMHKQAEIVKRLNAICAQVIPFIjSQKBQQQVVQAVKRAK 
QVTMAEUBOIGCXX>LOAQHI^HOHGLPVPLTPHPSGLQPPAIP 
PIGSSAGLLALSSALGGQSHLPIKDBKKHnDNDBQBDRDSIKSS 
SVSPSASPRGABKHRNSADySSKSKKQKTBBKBIAARYDSDGKX 
SDDNLWDVSNKDPSSPRG3 PAHSPRKNGLDKTRLLKKDAPISP 
ASIASSSSTPSSKSKELSLNBKSTTFVSKSHTPTPRTDAPTPGS 
NSTPGLRPVPGKPPGVDPLASSLRTPMAVPCPY PTP FGTVPHAG 
MNGBLTSPGAAYAGLHN1SPQMSAAAAAAAAAAAYGRSPWGPD 
PHHEHEVPAIP HiLTGIPGGKPAYSFHVSADGQl«PVPPPPDAL 
IGPGIPRHAEQINTLNHGKVV1IAVTISNPTRHVYTGGKDCVKVW 
DISBPGHKSPVBQLDCLKREMYIRSCEIiliPDGHTLXVGGEASTL 
SIWtJLAAPTPRI KAKLTSSAPACYAIAI SPDSKVCPStXSDGNI 
AVWDU{NQTLVRQFQGirnXL\SCIDISMKJrKIjWTGGIJ3NTVR^ 
B\DIAEGRQLQQHD/FFTSPVFSLGYCP\TBEJILAVQffiNSH\V 
BVI^VTBwPDKYDLKLHSSCVLSIiKPAHCGKWPXVSTGKrMLLNA 
W\RTPYG\ASIP\QSKBSSS\VLSCDI\SVDDKYXVTGS\GDK\ 
RATVYKVIY 



AARLCRAQPTKSAHM iRDLSKhnrPQTRHPAPHQPAQPPKFTISB 
SCDRI KBBFQPL(MYHSLKLECBKLASB1CTBMQRHYVMYYEMS 
yglwikmhkqaeivkri^iaicaqvipflsqbhqqowqavkhak 

QVTMAEl^AIIQOOQLQftQHLSHGHGIiPVPLTPHPSGLQPPAIP 
p T nftAA fll.I J iT-RymT HT.pt KlfflICKHHIfflDHQRDBDfi IKSS 

SVSPSASFHGAEKHHNSADYSSBSKKQKTKKKEIAARYDSDGBK 
SDDNLVVDVSHKDPSSPaGSPABSPRENGLDKTWJJCKnAPIS P 
AS IASSSSTPSSKSKBLSLNEKSTTPVSKSNTPrPETDAPTPGS 
MSTPGLRPVPGKPPGVDPLASSLRTPMAVPCPYPTPFGIVWIAO 
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Amino acid liegppent containing Sig nal 
(A=Alanine, C=Cyateine,. D^Aspartic Acid, B- 
Glutamic Acid r F- Phenylalanine , G=Glycine, 
H=Hiotidinc, l=Isoleucine f KnLyslne, 
I^Leucine, M=Me thionine , N^Asparagine, 
P=Prollne, Q*=Glutamine f R~ Arginine, 

S-Serine, T=Threonine, V=valine, 
W^Tryptophan, Y^Tyrosinc, ZaUnknoira, *«Stop 
Oodon. /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



KNGELTSPGAAYAQLHNISPCISAAAAAAAAAAAYC31SPVVGFD 

PHHIIHRVPAIPPKLTGIPGGKPAYSFHVSADGQMQPVPFPPnAL 

IGPGIPRHARQIinXKHGBWCAVTISHPTRHVYTGGiQSCVKVW 

DISHPG3TJCBPVSQJdXIIJIRDSYrRSCRIiIiPIX3RTLIVGGEASTL 

SIWDIAAPTPRIXAHLTSSAPACYALAISPDSKVCFSCCSDGNI 

AVWDliHBIQYW/RQFQGEHnXSASCIDISinDGTKLHTGGLDNTyBLS 

W\DLRBGRQLQQHD/FFTSPVFSDGYCP\TBKWLAVt3<KNSH\V 

BVLHVTZPDKYOLHLHBSCVLSLKFAHCGKtfF\VSTOKDNLIi?A 

W\RTPYG\ASIP\Q3X3SSS\VIiSCDI\SVDDKYIVTGS\GDK\ 

RATVYHVIY 



esgrrrrlkrrrspcpgtaggpgbtnpgpgacprgprbbaaaKm 

KIAPQKAPPVPGADGDrKKAP ABACS PSPASPPADGRLKAAAKR 
VTPPSDEDIVSGAVKPKDPWRItAQNVrFVDRVTGAYKOACQKIiSC 
RQI PKLLRQIflBFTDIfiHRMX^^LKEBKLDYKTCBALBgvrKR. 
LQFKVVDLgQTOIJJBDGASALFDMIgYYBSATHTiHISFHKHIGT 
P6WQAAAHMHRKTSC10YL\DARNTFIJiimSAPFVARAI£lXRS5 
IAVU!UniASLSGCTUtLIiATMiXMNMNIiRi&YL\An!lXLKGDO 
DSAQIiaiXiLKFliCSl^IliDLIUiHHVIiDSGLAri CBGlJCBQ RKGI. 
VTL\VLOTBQl»XBrEGHAPlGMTI»PflTOSLETWJlxailfPI<aJHGV' 
fiHLKNGLISHRSVLRIiGIiASTECLTCHGAVAVTkBFIABSPRIXRL 
DliRB^IKTGGU<ALSLAL£CVHH?LLRLDIJ7RBPKKRAVK5FIB 
TQKALLAEIOSGCKKMLVLARBRBBKBQPPQLSAfiMPBT PVTBP 
QPPDgPAAgVQWGAPSPAPSPDSDSDSDSD G KBBE g HB GBRDBT 
PSQAIOTRDTCSSBPQPPPSPPRSGPPLPSCLKPKFALAbPFBP 
PPGPBVKGGSOQIiSHBLSCSKHBKBLBBLLLKASQBSGQgn, 



SVTYICIILHKNTGALPADPVQWSOrFrPS-rKQQLl^FLCa^” 

YFYLKIPGFAILtKPLC3CLTKBNI>AI)AlDPKSFSH5SFRSLRXA 

IiKNASTLALPDSSQPF\SUTEABVQGCWBlLTQC3LGPLPV 



LPDVBKIGRRRQlRMOSVBKOAATSVSNPRGRPSRGRPPKbQRN 
SRGGQGPGVBKPPHIiAALI LARGGS KOI PIiKNI KHLACVPUGW 
VLRAALDSGAFQSVWV3TDHDBIBNVAKQFGAQVHRRSSBVSKD 
S STSLDA 1 1 BFliJYHNKVDIVGNI QATS PCLH PTDLQXVAEMIR 
KBGYDSVPSWRRHOFRffSBXQKDVRBVTKPUfLHPAKRPRRQD 
tfCCBLYBKGSFYPAKRHLXBMGYLQGGKMAYYBKRABHSVDIZTV 
DIDWPIABQHVL.RYGYFGKBKLKBIKLLVCSIDGCL’IKGHIYVS 
GDQKB I XSYDVKDAIG1 S DLKXSG IBVRLI SHRACSRQTLSSLK 
LDCKMKVSVSDKLAWDRWRKEMGIjCWKBVAYLQNEVSDKBCLK 
RVGLSGAPADACSTAQKAVGYICKCHGGRQA\IRBFAKRIC\IiL 
KHHOLINFMPKNRNLAVNI OEKK 



WTS WWMSSVLT I bLFSWJGWKMLiiY 5APSAGG YXiLPRXPVGTPA 
GGGFPRRHSVTLPSSKFHOSDLLSSLKGBPAPAJLSSRnSRFRDR 
SFSBGGERLIJ^KQPGGGQVNSSRYKT\BLCRPFEENQACKYC 
DKCQFAH»IHKLRSI*TRHPKYK!rKLCRTFHriGFCPTCPRCHFI 
HNAKERRAIiAOAimCSADRPRLQHSPfiPAGFPSAAATTiAATGLL 
DSPTSITPPPliaw>DLL0SPTLPDGTRNPP\AFSSQBLA3LFA 
PSMCLPG3G3PTTFI»FRPMSBSPHMFDSPPSPQDSI*SDQBGTI»S 
SSS3SHS03D3PTLDSSHRLPI FSRLS ISDD 



SSALQEPGXRMHTFGVPLPHRRKQI ISCNICQLRFNSDSQAAAH 
YKGTKHAKKLKAliKAMKNXQICSVTAKDSAKTTFTSrTTNTIllTS 
SDKTDGTAGTPAlSTTTTVBIRKSSVMTTBTrSKVBJCSPTTATC 
NSSCPSTBTKSBKAKRLL\YCSLCKVAVHaASQLBAHS5C3TKHK 
TMLEARKGSGTIKAFPRAGVHBJCGPVHKQNTaLQHKTFTCBICD 
VHVNSBTQLKQH1SSRRHKI1RAAGKF PKPKYSP Ytf KLQKIAHPL 
GVKLVPSKBPSKPlAPRTLPNPIiAAAAAAAAVAVSSPPSLRTAP 
AATLFQTSALPPALIiRPAPGPIRTAHTPVLPAPY 



I^KYBARLTLAKLDDFBEDKRDDDRNRVNQREKAAKJTELINKL 

nflobarxdlatvhsnpfddpdaablkpfgdpdsbbpitbtasp 

RKmJSFyHKSYNPPKBVQrTPQYL^FDBPEAFVTI KDSPPQST 
KKKHIRPVDMSKYLYADSSKTKBBHIiDESNPFYBPKSTPPPIfHI* 
vnpvqblbtbrrvkrkapappvls pktgvlnbntvsagkdlsts 



434 

















WO 01/53312 



PCT/U SO 0/34263 



cted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
aalnft acid 
sequence 





P KPS PI PSPVLStKPHASQSlUiVWCKBVTKHYRGVKITNPTPSW 
RHGI>SFCAIIiHHFRPDLIDYKSIiRPQDIICKNNKKAYlX3FASIGl 
SRId^BPSWnOJJUPDKLTVMTYLYQIRAHFSGQKI^VVQIK^ 
SSKSTYKVGNYETDTOSSVIX^BKPYAKLSPIiKRKPHLQQPISQA. 
VDFLSQpDSVFVNDSGVlSSESEHQTTODHI^PSTASPYCRRTK 
SDTKPQKSQQSSCTTSGSnDPGICSSTnST^QVLLGKKRIJLKA 
btlbi*sdlyvsdxkkwisppficebtdeqkixjtu>igshlkkkk 
LBHSRSLBCESDPSSPIKKTSLSPTSKI iCJYSTSRDLDliAKKKHA 
SLEQTBSDPDADRTTL£JHADHSSKIVQ^RI*IiSRQKKLKBRARVI* 
LBQARRDAALKM2JKHKTH13UVPFCHR0LSDQQDBHRaRQLRBR 
ARQLlAEAR5GGKMSKLPSYGKRAABKIiKERSKASGDENI03TBl 
DTNKE I PEG PVVGGGDEL™LENI)LDTPBQHS JCLVDLKL JCKLLB 
VQPQVANSPSSAAQKAVTESSEQDMKSQTBDLKTERIiQKrTBRF 
RFPWPSKDSTVRKTQIiQSPSQYIBHRPBMKRQRSXQEDTKKGN 
EBKAAITBTOHKPSSDBVUJMQPKDS\SQYWGHIAAIjKKKQKQ 
IDTRAAL VEKRIiR YI>IirrGKNTKKKEA>»K) BWFMbVNK KN7VLTR 
IQQfOLSUjBKBHPIiSRRYBMJiHBUUmLM EDWQKTRAQKRRB 
QLIJ^B^VM,VinO^DALVRDJ J 3 A 0 EXQ?^EHEDEHLSimJBQNKG 
KM&KKBEKCVLQ 



ptalWtapscafpvqprqpsvbglsqitkslyisugvaaknklm 

LSSNQXTMVIIJVSVBWHTLYKDIQYI^JVPV71D3PHSHl»CDPro 

PIADHIHSVKMKQGR\TIiMCAftGVSRSAAlfCIAYI^KYHAMSI* 

U1AHTWTKSCRPIIRPWSGFHBQLIHYEFOI/FGKNTVHMVBSPV 

GWIPDIYBKEVRLMIPL 



Tg Kl)BK TAHWbEMVLIHPWKRX»SBlCVAKlSBPBADVESVLGVS 
NLLQVLQKPKGSIiKSSKKKNGlCVRPADKII^SNKBKKKCVSSBG 
BKXBCWELTTEPSLTHNSSGLL8PLRKKPLEDLVCIKLADIS I NY 
VNERKSEQHLBFLSTLLDSPSSSRVFKMLIiCTBKQSIVQAKPLB 
IAKLVQKNPAVQFLYQKLIGWLNKDQRKDPGFLVDIbYSALRCC 
DNDMEHKKVU)I>LTI0/DI*KWN5UiXl IKKACPSSDKHRLVTP WI> 
KGDHX3EKI>VICLADCIjCNKDLBSRVSSBSHFSERW1‘L*LSLVLSQ 
HVKHDYIiIGDVYVEHIIVRLHETLFKTKKbSEftBSSIXSSVSPIC 
DVAYNYFSSAKECliljMPSSBDIJjIiXLFQIjCAOSKEKTHbPDFLI 
CXLKKTKLSGVHIJjVHQTOSSYIQSSTFIiHLSALWLKNQVQASSI> 
DIHSIXJVLLSAVDDLLNTLLESBOSYLMGVYIGSVMPNDSBWBK 
MRQSLiP>4QtJLHRPIJiBGRLSI4SnrBCFKTDPXBQI>I KTLPSHLCT 
SAU*SKMVLXALRKBTVIjBNHKI<BKIIAELLYSI<QWCEBI»I2NPP 
in^IGPCEIIXJKMtriTYDJniRVlXaJMSGLIiQliLFNRSRSlfCjrLW 
Sill IAKLIIjSRSISSDZVKPHYKBKESPFPLTBGKIiHTXQSIjCP 
FLS KBHKKg FSftQCIPAIiLGWTKKDLC SraGGP G H IJlXFMSCbQ 
TKSIDDGKI*XiHaXLKIIISNKKKHHDIFLPSC2iLSBASpKVLGV 
NIKI IRFI*SbPIJCYCSSPliABSKin3PIHCSMIiAWljBTTSSNQAL 
YSIPLVQLPTlCVSCDLACDLSAPPDSTTIiDTXGNIiPVNLlSBWK 
8 FPSQQXHSliIiLPr LV T V 'l ljKM KDV SSTS FQHBMLKPM CBTIiTY 
ISKEQbLSHKI*PARLVADQiCINIiPBYIX 7 rLLfiTIAPLIaLFRARP 
VQIAVYUJttYKlMPBIiPQYDQDHIjKSYtSDBKKBPAIiSPPAMjMS 
IXSIQKDIiLEJNVXGCIPVGQrvrriKPIjSKDPCYVljGYLLTWKLI 
LTFFKAASSQLR^YSHYLRKTKSIitfSLLYQLFRLMPBHPTYT^B 
TAVEV?NKD PKTPFTBBLQLS IRBTYVLPYHI PHLACSVYHMTL 
KDLPAMVRLWWNSSEKRVPUIV33aPTSKYVSSVI*SPQBI33VQT 
STQLFWGMTVKARATTRBVMATYTIBDIVIHLI IQLPSNYPLGS 
I rVESGKRVGVAVQQWRHIMLQIiSTYLTHQNGSIMBGIiALWKKN 
VDRRPBGV8DCHICPSVXHGFNYSLPKKnCRTCKKKFHSA\CliY 
KWFTSSNKSTCSLCRBTFP 



HAEXi^ASPSDIKI^VSNGDrFIiCSttRQSOGMtKVRQVSLTYP 
GSPAPSHSLPLQPRSGGSJUCPSRAR/ PDPHQLPDDTSSAQSROY 
GAQRRi’GGbSYPAASPTPHAAPLADPVSKMAMAYGSSLAAQGKB 
LVDKHXDRPIPITKLKYYPAVDTMYVGRKIjGIiIiFPPYLHQIWBV 
QYQQDTPVAPRlTJVKAPDXiYIPAMAPITYVLVMLALGTQDRFS 
PXHJ/SXJASSALAifliTIfVlAIIiLSLYliVTVNTDIkTTXDLVAFU 
GYKYVtMIGGVXifflfM^eKIGYYLVUJSCCVAIFVFHIRTUlLK 
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IiCLFFFFPCATPVLPLPSIiXSAL/CLSHLSVSSwFCPCQPPLPC 
PLPPLQMKTAKESLSTEOSERG 



KLEWOTmjSKVXISRBVTMIASKFGIGQQVRHSIiLQyi/SVW 

DIDPVYSLSBPSPPEIAVKDKLRAAFWTHVVMBDIJKGLPVHTyL 

AEAQI^HX^DSKP\BQPSMDBlAgrXRgQLOAPRTiHN 



SPIiPQVAAWNYLBHRI»SDSiNPWRMI^KGTItTPLQHPQPPPPgPG 
ftHS pgatpgpgtatakhssgvapaaspaapspgssggggffssl 
SNAVKQ fPXAAAAATPSB(jVGGGSGGftGR(3GAASHVIiLVIDBPHT 
DWAKYFRGKKIHGBIDIKVEQABrailLlCLVAfiAMGGPSVEMBVI* 
EHGVKWHSLXPDFVLIEQHAFSMAHNGDyilSLVIGLQYAGIPS 
VNSLHSVYNPCDKPWVFAQHVRliHKKLOTEBFPIJGDQTFYPliHIC 
StCLSSNTrn^VVVltMQ^TXiWGtfGK V KVEgfOHDPQDIASVVT^LT 
KTTMftKPPIIlAKnyVRVQKIGOHYKMMRTSVSGBWKTHTOSA. 
HLEQIAMSI!RYKLS#VDTCSBIPGGLDICAVEMmKDGaDaiIE 
VVG 55 MPI*IGD»aDKDXQIdVELVVHSM 2 k 0 AliPRQH 0 IU>A 5 PGS 
GSEGQTPSPGALPLCTlQTSQQPAGPPAQQttPPPQGGPPQPGPGP 
QRQGPPLQQRPPPQGQQHLSGLGPPAGSPIiPQKIiPSPTSAPQQP 
ASQAAPPTQGQGRQSRPVAGGPGAPPAARPPA3 P3PQRQAGPPQ 
JtfrRQTSVSGPAPPKASGAPPGOQQRQGPPQKPPGPAGPTRQASQ 
^PVPRTOPPTTQQPRPSGPGPftGRPKPQLftQKPSCDVPPPATA 
AAGGPPHPQLHKSQSLTNAPNIiPKPAPPRPSbSQDBVKABTIRS 
LRKSPASLFSD 



EGLTGPC&P V P PUJGRGPPHGJORAGHRRAVBUAGPBS LPPUPK 
SLIHDS PBAGTHQGPLDABTBVGADRCCSrXAyQBQRPQVEQVGK 
QAPLS PGLPAh^GPGPGPCKDPAQAGGAGftSGS EPLVTVTVQCA 
PTVMiRARRGADIiSSLRAIjIjSQAIiPHQV^OLGQltS^lAPGSDGH 
HVPIPB2ESLQRAKQnAAACpRGLQIjQCRGRGGRPVLY0VVAQH 
SYSAQGPHDLGFRQGDTVDVIjCBVDQWfLBGHCZDGRIGIFPKCP 
VVPAGPRMSGAPGRLPRSQQGDQP 



PVLVTS nRMRKADTXJlPPQLMKVSADI ISTVBPNHTGELLATGD 
KGGHWIPORBPBSKNAPHSQGBYDVYSTFOSHBPEPDYLKSLE 
ZEBXXZnCIRHLPQQNAAHSItLSTlIDXTI^UIKITERDKRPBGyN 
UOJHBGKIiKDLSrTVTSLQVPVIjKPTffilWVEVS PERI PANGHTYH 
INS ISVNSDCKTYHSAODIiRINLHHIAXTnRSFTP\NrVDIKPA 
NMEDLTEVTEA3EFHPIIIICNLFVYSSSKGSLRLCDMRAAALCDK 
BSKLFKBPRDPS3^RSFFSBtIS\SVSDVKPSHSCRYMLTR\DyL 
TVKVWDL\WfflARPIBTYOVHDYLRSKLCSLYHNDCI PDKPBCA. 
ieSGSDSVIHTGA\YNNPPRM P n RH T K RPVTL\BASRBSSKPRAV 
LKPRR VCVGQKRRRDD I SVDS LDPTKKXUTTRWHPAENT IA IAA 
ItWLYI PQDKVNS WH 



GIRTPKFCIXSPTSDUBMRNGRGftGKRMRPNSNTPVNBTATASDS 
KGTSNSSKTRAGANSKGRRGSQNSSBHRPPA33TSEDVKASPSS 
ANKRKtnCPLSDMRUiSSSKDSKBSKRVKTNSMGSATGPLPGTKV 
BPTVLDHNCPS PVLlDCPHPNCtJKKY H flXBGUYBQAHAHTDDD 
SKPEADGDSRYGEKPILHADLGSaJG\ASVSQK\GSLSPARSAT 
PKVRLVEPHSPSPSSKPSTRGLCKKKLSGBGOTDLGALSNDGSD 
DGPSVHDBTSNnAFDSLHaraffiKaKCKKPasiiKPKKIPSKSIiK 
SARPI /APIAXPPQQIYTPQ'C A TP TA ASPGSSSGLTAXVftQAMP 
NSPQLEP IQ PKPTVMGKPFTVNP ALTPAKDKKKKD KKKXBSS KE 
LBSPLTPGKVCRAEEGKSPFRBSSO^GMKMBGIiLNGSSDPHQSR 
LASIKAEADKJYSFTDNAPSPSIGGSSIILENTTPTQPLTPLHVV 

tqngarassvktns paysdisdaghdgegkvdsvks kdaeqlvk 
EGAKKTIiFPPQPQSKDSPYYQGPBSYYSPSYAQSSPGALNPSSO 
AGVRSQALKTKRDKBPBSIEGXVKND1CBEKHPELS33SQQPSV 
IQQRPKMYMQSLYWQYAYVPPYGY5DQSYHTHLLSTOTAYRQQ 
YEEQQKRQSLEQQQRGVDKKAEMGtiKEREAAliKBEHKQKPSIPP 
TLTKAPSliTDLVKSGPGKAKKPGADPAKSVI IPKLDDSSKLPGQ 
APBGLKVBgiSP ASHLSgBA S KMCTGABOGgQABMDP I LWYRQB& 
EPRMWTYVYPAKYSDI KSKDERWKKBRDRTLKKER3RS KDSVPK 
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Amino acid segment containing signal peptide 
beginning nucleotide (A> Alanine, C=Cysteine, D=Aspaxtic Add, E== 

nucleotide location Glutamic Acid, F= Phenyl alanine, G=Glycine , : 

location corresponding H^Hietidine, I=Iooleucine, K= Lysine, 

correspo ndin g to first L*»Leucine, H=Methionine , tf=Asparagine, 

to first amino acj^d P= Pro line, Q=Glutamine , R=Arginine, 

amino acid residue of S=Serine, T° Threonine , V= Valine, 

residue of amino acid W«=TTyptophan, Y«=Tyrosine, XMJnknown, *=*Stop 

am i n o acid sequence Codon, /^possible nucleotide deletion, 

sequence \ ^possible nucleotide insertion) 



KDGKBST SSDCKLPTSEBSRIiGSKBPRPS VHVPVSS PLTQHQSY 
IPYMHGYSYSOSYDPNHPSYRSMPAVMMQNYPGSyLPSSYSFSP 
YG3 KVSGGKDADKARASPSVTCKSSSBSKALD1LQ0HASHYKS K 
SPnSDKTSQBRDRGGCGWGGGGSCSSVGGASGGRRSVDRPRT 
SPSQRLMSTHHHHHHLSVSLLPAQYNIiPYAAGLSSlAIVASQQG 

STPSLYPPPRR 



AHTOVFDRRVPSXHSGDFOAPFQTSAAMHHPSQBSPTbPESSAT 
DSDYYEPTGGAPHQYCSPTSA3YG\KAIJJPYQYQfYH0VKGSAGS 
TPAKAYADYSYA3SYHQYGGAYNRVPSA33?QPBKBVTKPKVEMV 
BGKPKKVRKPRTX Y5S PQIAALQRRPQKTQYLALPERAELAASL 
GLTQrrQVZIW FCPKRSKI KKIMKNGBMPPEHS PSSSDPMAQJSP 
QSPAVWBPQGSSRSI^HHPHAHPPTSNQSPASSYLEHSASWYTS 
AASSINSHbPPPGSLQHPIALASGTLY 



KGLRRTCCAMBBSD3BKTTBKENliGPRMDPPLGBPG\GSlJGWVI* 

PHIAHKKKVLLMGKSGSGKTSMRSIXFABYIARDTRRlXSaTItD 



D0GGQDTFMEZnn?TS0SDNXFRNVBVLirVFDVBSR5IjSKDMHY 
YQSCLfiAILQNSPDAKI PCLVHDTOLVQRDQWILIPKKRKBDLR 
RLSRPIjECSCPRTSIWDETLYKAWSSXVYQUIPI'IVQQLBMNLRN 
FAEII EADBVLLPHRATPLVISHYQCKBQHDAHRPBKI8NI IRQ 
PKLSC5KLAASFQSHBVHNSNFAAPIDI PTSHTYVMWMSDPS I 
PSAATLlNIRHARKHFBKLBRVDGPKQCLUflR 



KQLEJoXXIAMfiESDSBKTTtSKBffljGPRKDP PLGKPG\GS LGWVX, 
F?fTAMKKKVLI*K5KS GS GKTSMRS 1 1 FANT IAROTRRLGATTLD 
RlRSbQIKSSLSTYSLV0SVl39TKTPDVRHSHVRFX£NLVLNLW 
DCGGQDTFMKNYPTSQRIMJIPIlNVHVLrYVFDVESRBLKKDKHY 
YQSdiSAIbQNSPnAKIFCLVHKMnbVQBOQRDLIPKKRBEDLR 
RLSRPLBCSCPRTSIWDBTLYKAHSS 1 VYQLI PNVQQLEMNLRN 
PAKI IBADEVUJ ? ERATPLVISIfYQCKEQRDAHRFBKISNI IRQ 
PKLSCSKXAASFQSMEVRHSNF7VAFlDrFTSinYVMVVMSDPSI 
PSAATblNIRHARJCHFaKLKRVDGPJCQCliLMR 



KGLKRTCCAMBESDSBKTTRKEmiGPRMDPPIiGKPG\GSLGWVL 
I^TAHX3pCVLlHGiCSGSGKTSMRSllFANyZARI>TRRIiGATXLD 
RIHSLQINSSbSTYSLVBSVGNTKTPOVKHSHVRPLCaiLVimW 
DCGGQOTFMEHYFTS<3!RCHIFRNVEVLXYVFDVBSRKIiBKDMHY 
YQSCLB AIbQK SPDAKIPCLVHXMDLVQgDQHDLI PKBRHBDm 
RLSHPLRC8CFRTS I WDETLYKAW3S IVYQLI PIJVQQIiBMSLRN 
FABI IBADBVLLFBRATFLVISHYQCKKQHDAHRFEiaSHI IRQ 

pklscsklaasfqsmevrnsnfaapidiptshtyvmwmsdpsi 

PSAATLrHIRMARKHFBKliERVtXJPICQCZLLHH 



Kin^MNKSRWQSRRRHGRRSHQQNP W PRLROSBDRSDSRAAQPA 
HESGHCJDDBSPSTSSGTAGTSSVPBLPGPYPDPBKKRYPRMiPO 
HHNCKPIjTKKSIRQKEMBSKRIaHUjQEKDRRKKIARHGPNASSM 
LRKSQlJGFIlIVTHYCHLAHEIiRIjSCHHRKKVQIRSMDPSAIiASD 
RFTlLILAOTNSDRLPTVNDVTVGCSKYQIIHIi^LinrPTIiKVFM 
HHfl/YFTHRKV\HSVCHASIlJHLDSHXliLCI>IGIiARTPGCATLL 
PASLFVHSHPAGITRPG\MLCSFRIPGAHSCAS?3iaiOAHNCPS 
TCLSRRVLI,TNVVTGHRQSFCrrNSlJVbAQQPAllHAPLLFNGC3lfi 
G3IFAIDI*RCGK(X3RG^nCATRLFHDSAVT5VRXLQDBQYIllA3D 
NAGKZKLSlDLRTTKCVRQYBGHVNBYAyi^LHVHBBBGXLVAVG 
QDCYTRI KGIBDARLLRTI PSPTPASKADIPSVAFSSRLGOSEG 
APGLIMAVGQDLYCYSYS 



ppxarlqkpgtsrrhmaapsgvhulvrrgshrxfssphihiylh , 
kosssqqhrk fffrrqrdishs i vlpaavssahpvpkhikkpdy 

VTTGXVPDWGDSIKVraiSDQIQGITOACQIARHVIJJAGKSLKV 
DKTTKBIDALVHRBI I SHNAYPS PLGYGGPPKSVCTSVNNVLCB 
GIPDSRPLQDGDIINIDVTVYYNaYHQDTSBTPLVGHVDEOGKK 
LVBVARRCRI)RAIAAC!RAGAPPSVIGNTISHrTHQWGFQVCPHF 
VGHOIGS YFHSIPBIWHHANDSDLPMEBGWAFTIEPIITBGS PB 

fkvlbdawtwsld/tsrvsaqfehtvlitsrgaqiltklphka 
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Amino acid segment containing signal peptide 
tA=Alanine, C=Cysteine, D^Aspartic Acid, E=> 
Glutamic Acid, F=Phenylalanine, G^Glycine, 

H= Histidine, I“I soleucine , K^Lysine, 

L=» Leucine, tt=He thionioe , IfcAsparagine, 

P~ Proline, Qc=Glu famine, R=*Arginine, 

S -Serine, T*=Threoaine , V»Valine, 
H=Tryptophan, Y=Tyrosiiie, X=Onknown, *»Stop 
codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



IGQSAGPPPQTLQCSALHHDP 
CCAHCSRPCRIXXPPAOQCHTHVPPGNAIiKGVQPPKLSRTLALI 
SSRBPPRKKKKSQTBTGKBRBRTSPLTQGGKRFE^iQHGLRGICM 
TLLITGD3 IVSAEAVVfDHVTMAl?RKtiAPKAGDVXKVLDASNKBW 
WWGQIDDBBGH PPAS FVRLWVNHEDBVEBG PS D VONGHLDPSSD 
CLCLGRPLQSRDQWRANVXNHIMSTKEHYlKHLKDICKGYIiKQC 

rkrrib^fsdeqlkvifgnikdiyrpqmgfvrdlskqyhhddphl 

SBIGPCPLBHQDGFWXYSRYCUNHLDtACMELSKLWKDSRYQHFP 
HACRLLQQNID1 A\10G7LLTPVQX1 CKyPLQIAKLLKTTAQDH 
SDYR YVTJ^AIAVMRHVTQQ INERKRRLKNXD KlAO^QASVLCTfB 
GBDILDRSSEiaYTGRMAWITOP\YGKNQGRVFTIiFDHQWVtjCK 
KDLrRRDX L Y T K GR1DHDKYKWDI EDGRDDDPNVSMKHAFKLH 
HKBTKBXHIiFFAKICLHHKIRWLRAKRKKHKMVQHDHKIGPKISB 
e^KRQAAMTVRXVPKQECGVHSASSVPPSYVPPQDPLKRGQYLVP 
\DGIAQ30VFBPTBPECRSQSPFWQHPSRLTPFKK 



SGGRGPVRVRSEQLSPSAEQVSQISQXSLGRRPLSSLPPPPSRA 

laptrapdtaltimbvabvesflnps ckihtfrpshebfrkfnk 

YIxAYMBSKGAHRAGLAKVIPPKBHKPRQCYDDinHLLIPAPIQQ 
HVTGQSGLFTQYNIQKXAWTVKRFRQLANSGKYCTPRyLDYROL 
ERKYWKNLTFVAP I YGADIHGS IYDBGVDBRJIXARIJITVLnVVB 
BBtXJlSIBGVHTPYLYFGMKKTTFAMHTKDHDLYSINYLHFGBP 
KSWYAlPPEHGXRliffiLAQGPFPSSSQGCDAFLRHKKrLIGPSV 
LKKYGI PPDK1TQRAGBPM1TFPYGYHAGPIJHGPHCARSTNPAT 
VRWI DYGKVAXLCTCRKOHVKXSMD IFVRXPQPDRYQtAnQQGKD 
IYIIDHTKPTPASTPBVKAWLQRRRKVRKASR3FQCARSTSKRP 
KADKBKBVSDHVDGABVPOTDSVTDDLKVSKKSKAAVKLRNTKA 
SSBRBSSASRMQVBQNLSDHIKLSGNSCLSTSVTBDIKTKDDKA 
YAYKSVPSISSEADDSIPLSTGYBKPEKSDPSKLSWPKSPBSCS 
SVABSHGVLTBGEESDVBSHGNSLBPGEIPAVPSGERNSPKFPS 

iaegknxtskswrh plsrppars phtlvbqoaps deelpbvls I 

RKKVK B TR SMAKPLlHLWOTKPPNPAAgQgYKATVARMKPHCAI 
CTLLMPYHKHJSSNBSNnARKBTKLDBVVTSEaiCTKPLI PEHCP 
IYSBBHIBYSPPHAFLBKDGTSLIilSCAKOCVRVHASCYGIPSH 
BICXXniLCARCKRliAkllABCCLCMZiRGGALKQTKZiNKHAHVKCA 
VAVPBVRFTtJVPERTQIDVGRI PLQRLKLKCIPCRHRVKRVSGA 
CIQCSYGRCPASFHVrCAHAAGVI>\NBPDDWFYVVNITCFRHXV 
NPNVKSKACKKVI3VU0TV ITKHRNTRYYSCRVMAVTStfl'F YBV 
nFtSXSSPSRDTFPRDIVSSDCLKZ^GPPAEGEWQVXHPDGKLYG 
akypgshiahmxqvbpkdgsqiahkrkdiytijdbelpkrvkarf 

VSAT® OUJTPOOVHS LSS PHVSQAQQETYLGFW INS KX5QCNI F 
L6GTY 



PVARLKBQEGBGGLGPRKSKGRARGRERRRKMQin'SCCFVFLVQ 
GSLYLVIOOQDDGPPGSHJPBRDDHKGQPRPRVPRKRGHISPKS 
RPMAKSTUXJLLAPPGEAi^II/^PJWRPNHSPPPSAKVKKJPG 
KOTFYSKI KTVTtfllliVTtnCrVIJHGNGTPSVHPQHHATGQGNXS 
ISLVPPSKAVBFHQEQQIFIRAKASKIFNC\RMBWBKVB\RGRR 
TSLFTHDPAKICSRI3HA(^SATHSCSQPFXWCVY1APYSTDYR 
LVQKVCPDYHYHSDTPYYPSG 



HPLPSASLGLPSVSLGVSLCVRSALLRAWPMliPKRRRARVGSP 
SGDAASSTPPSTRFPGVAIYLVR PRMGRSRRAFLTGLAHSKGPR 
VLDACSS KATHWMKKTSAKBAVSWQHRRMAAAPPGCTPPALLD 
ISHLTKSUSAGQPVPVECRHRLEVAGPSKGPLSPAKMPAYACQR 
PTPLTHHNTGLSEALBIIiAEAAGPBGSBGBtLLTPCRAASVLKAL 
PSPVTTLSQLQGLPHPGBHHSRVVQKIJiBHaVCBEVERVRRSB/ 
RIJTQXKJVGVKTADRWYRfiGLRTLDDLRBQPQKLTQQQKAGBP 
! SRRAGPHASXiNCTLDPSASTP 



QQDF3SIiADLTDHRAHRCPGDGni>DPQZ*SWVASSPSS KDYASpT 
QMIGDGCDLGLGBKEGGTGIiPYPCQFCDKSPIRLSYLKRHBQIH 
SDXLPFKCTYCSRLPKHXRSRDBHIKLinGDKXYHCXSCRAAPS 
RSPHIJaHUcraSSSKPPKC^CKRGFSSTSSLOSHMQAHiWKK 
EHLAKSEKRAKKDDFHCDYCBirrFSQTBBLKKHVLTRHPQLSEX 
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\EGVYCHLDSHRQPDSSNHSVSPDPVLGSVASMSSATPDS3ASV 

HEGSrPDSTLKPLRGQKKMRDDGQGWTKWYSCPyCSKRDFNSIi 

AVLBIHLKTIHADKPQQSHTOQICUXSMPTLfNLlIBHVRKLHKN 

HAYPVt^PGHXSAFHCNYCPEMFADIKSLQKHIRVSHCEPNANP 

SD©TOAFPCKQCa«;FLTESSI.TBHIO\Q\ftHCSVBSAHIiESPV 



a'ajou tk'i— » :• 1 i t * w .« : > r* winu omj ^ 



kkskaeqspvssdvkvsspkrqrlsasrnsishbkypchqcduc 



' I i 4~I|I ;>«!•» n i ) « » J r •) : i 



YKTrSTHYVCKSCDKQPSSVDD\LQKH\liLDttPHPIiCCTHCT\L 

CpBVPDS\KVSI\QVHIAVKHSMBKKWyRCI! ACWgDF HKKADIO 

VHVKHSHI/JNPAK7^HKCIPOGBTPSTBVKLQCHITTH3KICYNCK 

FCSKAFHAI IlJiBKHIiRKKHCVFnAftTBKGTWWjVPPWATKKAfi 

PADWXSOJJCOTKAPllSHKASHDDVnASBPf«GCDIC 3 GAAYIMB ! 

VLLQSHRLRDHNrRPGEDDGSRKKABFIKGSHKCHVCSfiTPPSE 

HGLRKHLaraRGPAKHYMCPICGHUFPSIXTLTBHKVTOSKSLD 

TCTCRICKMP&QSEEBPI BHt3CHHPlHaNSLTGPSCVVCS«QTVr 

q mgT. tf XHGTP HMQjCLAGSfiAASSPKGQQMKLYKCftliCItKBra 

SKQDI*VKLDVHCn*PYGI»CAGQ1ARSAKGQVGGIjAPPKPADRPCA 

GUlCPECSVKFEaAHDIiBSHMQfVDHRDLTPBTSGPRKOTQTSPV 

PRKKTYQClKCQWrFKHBRKIQIHVftiraMIBBGIHHBCKLCNQH 

FDSPAKtiLCHUBHSFBGWX^rFKCPVUETVPVQftHKLQQKIPA 

VHGQKDKiroCSOCPQlCPFPQTELQHHIMSaHAQ 



SYBIVGKNKLEVHHSC^KAWXCSI^RLIiPLGKHLPLLDRGFH 
KBPRSRGSRBRDNMLHIiHHSCLCPRSWI*PAMlAVLI*SIAPSASS 
DZSASRPNILLLNADDLG IQDIGCYflHSTWRTPHIDRIABDGVK 
LTQHISAASLCTPSRAAFIjTQirYPVRSGMVSSIGYRVLQKTGAS 
GQLP TOBTTPAKILgKKGYAPGUGKWHIgljHCKSA SD HCHHPI* 
HHt5FDHFYt3MPPSLrtSDCARWBI£BKRVNIjEQKLHFLFQVIAI»V 
ALTLVRGKLTHLIPVSW><PVIWSALSA\n^JUI ASSYPV GALIVHA 
Ixa^LMRHHTITBQPMCPQRTTPLILQBWASPtKRHKHSPFLLPV’ 

SFLHVHI PLlTHKNPliGKSliTGI>yUDjyVKaQ}WMVGRII*DTLI)V 

EGLSNSTLTYFTSDHGGSLERQDCan'QYGGHNGIYKDGKCMGGN 

BGGIHVPGIPRHPGVtPAGRVIGEPTSLHDVFPTVVRIiRGSKVP 

Q|)RVIDGQDI*LPIiI*LGTAC3BSDnBFIiMHYCEaPLHAARWHQKI)R 

<3TMHKV HPV T PVF p?SGftGACY(atKVCPCgGBE^VHHDPPI*LFD 

l^RDPSBTHII*TPASBPVFTQVHKR\VQQAVWBHQRTIiSPVPLQ 

LDiaXaaWRPWI^PCOGPFPIOfCLHKDDPQ 



MynPtKRKTXKNX^FENHWjWJGMPAV RVKTKLIiBS BQGS PN 
VHNYPDMHAVPXJU^NNVKGRPPKDSX>SV0HFQTOraPVni>SX6nC 
ARTSPTAVSSSPVSmASASSPSSTSTSSSSSSRLASSPTVITS 
VSSASSSSTVLTPGPLVA3ASQVQGQQFLHIIHPVPPSSPMNIX) 
8IWLSHVHRIPVVVQ3VPVVYIAVRSPGNVNHTXVVPLLEDGRG 
BBXAQMDPRGI^PRQSICSIJSDDDDIjPKVTIiDSVHBTGSTALS ia 
RAVQEVHPS PV3RVROWRMKNQKFPCS I S PPSIBSTR RQR'rVU J 
PPDSRKTAYSTDC3)F\EGLQQKLYTKS53PGRVHRHTHTGEKPY 
KCTWEGCTWKPARSDKLTRHYRKHTGVKPFKCWXnDRSPSRSDH 
LALHERRHMLV 

KTKAKl^PENCKDCHI L^AEAPKSKKICKSIjKIGGLVTGI lalt 
ltvlfkgskhfhpkvpkkaydhkhtfyskgkxkkiymkidpvtr 

TBIFRSGNGTDBTLKVHDFXffGYTGXYFVGLQKCPIKTQtKVTP 

KFSKPKBEIDKNBBITTTPPBOSVXWVPAEKpra?RDFLKNSKI 

LKICDNVTMYW\INPTL\ISGTFAKQIjHHNPAFIHVSKLQDPE 

bhxjbdi^fpanbkkgibqjjeqwwpqvkvbktrharqasebblp 

ikdytbkgiefdpmldbrgycciychrqihycrrvcbpllgyyp 

YPYC^QGGRPIOOTMPCNWWVARMUSIV 



“BTMAKNGPBBCBDCHIIjNAKAPKSKKICKSlJUOGLVrGILALT 
LIVLFUGSKHPWPEVPKICAYDMKIffFYSSGKY KKXYM BIPgvrR 
TBI PRSGNGTDKlLBVHDFKKGYTGrYFVGIiQKCP'XKTQIKVTP 
BFSgPggBlPBR KKl TTTPPBQSVIWVPABKPIEHRDFLgWSKI 
I^ICDmmm?\XNPTL\lSGTPAKQLHHyFAFIILV 3 RXX?DPB 
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ZD 
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Anxno acid segment containing signal peptide 
lA=Alaniree, C=Cyeteine, I>° Aspartic Acid, B«= 
Glutamic Acid* F^ Phenylalanine, G=Glycdne, 
H=Histidine, I=Isoleucine, K= Lysine, 
L-Leucine, M=Kethionine, N=Asparagine r 
P=Proline, Q=Glutamine , RsArginine, 

S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine f X-Ohknown, *aStop 
Codon, /^possible nucleotide deletion, 
\opossible nucleotide insertion) 



BBGKDLHPPANEKKGIEQNBQWVVPOVKVBKTRHARQASKBELP 

IHDYTBNGIBFDPllLDERGYCCrSfCRRCTRYQlRVCliPIiLGYYP 



yPYCTfQGGRVICBVIMPCIWWVAHMLGRV 



6064 



311 



"&065 



HiPQSLPRPTOHSPPYSLEKMTDLVAVWDVALSXXJVEKXBFEHG 
TTSGKH WYVDGKKB IRKEWIPKLVGKBTFYVGAAHTKATIIJID 
MSGPAYHYTLBIlKnCSLKSSMBDRSKTTKTWI£NDGENFRIV 
LBKZAMDVffCN3RKLBTAGBPVDDGTETBFSZGTH\ACYXKAV\ 
SSG\KRKBGIIHTLIVDNRBIPBIAS 



1153 



641 



KSVRVARVAHVR 13 l^YRRGASSPPVP^PGAQGVAk!LLRDAT^' 

ABSBAFHAATBRRMPGQCSVLLPPGQGSQVVGMGSGLUiYFRVR 

BLYAAABHVLGYtniELSLHGPQBTIZSRTVHCQE^IFVASLAAV 

BKLHHLQPSVIBJCyAAAGPSVGBPAALVPAGAHBEAEG 



6066 



68 



3470 



60^7 



858 



6068 



13 



VKENMPATRKPMRyGHTBGiTrK V CP JDSGSPI VTOGSDGDVRIW 
HDLDDDDPKPINVGBKAYSCALKSGKLVTAVSNNTIQVHTFPEG 
VPDGILTRPTTNAHHVVF®K3X3TBCtAAGSSD\PLWKIVDVMDSS 
QQKT?RGHBAPVLSI>SFDPKDIPIASASCDGSVRVWQISDQTCA. 
ISWPIJjQKCNDVIHAKS ICSLANQPKSGKLLAIPVBKSVKLYRR 
BSWSHQFDLSDNFISCyrUKn/TKSPO^JYIiAAGeiBGXiJIVVJNV 
BTKlK3<BRVKHEKIJXAI<3JIANHPTCGRISYTDABGNLGIiI»0JV 
CDPSGKTSSSECVB5RVBKDYNDUTX3DDMSNAGDFLNDNAVEXP 
SFSKG1 INDDEDDHDIVWASGRPaQJtSHXLBDDENSVDISMLKT 

gssllkkksbogqegsihnlplvtsorpfydgpkptprqkpfqs 

GSTPLHLTHRPWVTOiS IGI IRCYNDEQDMAIDV EFH DT S 1HH AT 
HLStfrLNYTIASIiSHBAlLlACBSTDBLASXLHCLHFSSNDSSK 
EWI IDLPQNBDIBAXCLGQGMAAAATSAUjLRLPTTGGVQKBVP 
SLAGPVVSMACHGEQLFIVYHRGTOPDGDQCLGVQLLBLGKKKR 
QXLHGDPLPLTRKSYUWIGPSAEGTPCYVDSBGIVRHLNRGIjG 
NIWTPI CNTRKHCHGKSDHYWWGI HKNPQQLRCI PCKGSR PPP 
TLPRPAVATLS FKLPYCQ XATEKGQMBE^FWRSVI FHNHLDYLA, 
KNGYBYEBSTKNQArKEXJQEXiLMKMlAl^CKLBREPRCVBIiADL 
HI'QNAVNLAIXYASRSRKLILAQKLSKLAVBKAABLTATQVBSK 
BKKBDPRKXLNACJYSNTATBKSQPR FRNQ VBSDAJEDSGEADDBB 
KPBIHHPGQNS PSK3TOSSDVSAKSOAVTPSSQGHVNPFKVSAS 
SKBPAHSPOTSARSTOHJWMGKSSKKSTALSHTTNNBKSPI IKP 
LIPKPKPKQASAASYPQKRNSOTNKTBHVKBRaLKNVLSETPAI 
CPPCNTENQRPKTG FD HWLgENRSWILSDHPPPSDBADI IRBGH 
I3PRVLSTBBKKVHANKAK0BTASBGTBAKKRKRVVlffiSDBTKN 
QH8KAKBNLNL3KKQByLDFSTNQKI*SAFAFKQB 



L^QRICVLI^RGKMAVTGWLBSIJnAQKXAIJjQDCiRRKVHYt*^ - 
PDGKBMAKEYDBKTSKLLVRBWRVKSALGAKGQWQLSVroPAPL 
GAtaiIiJPKLIKBSHANPIFWUa)TKMSFQWRIiaa*PYPKI>VYSV 
SVQQKERCX I VRTTNKKYYKIIPS IPDIJDRHQLPLDDALLSFA\T 
PTAP 



1730 



GSKMADLANEBKPAIAPPVFVFQKDKGQKSPAKQKSLSDSGBSP” 

RGBAEZiPHHGTGHPKSAGKHAIiKPPAPAGASASTPPPPAPKAQL 

PPyPRELAGRSAGGSSPBGGKDSDREDCTTYCPPVKHERTSSLTQ 

FPPSOSHERSSGFRIJa>PTLIHGOAPSAGLPSCKPKKQQR3VLR 

PAVLQAPQPKALSQTVPSSGTNGVBIiPADCTGAVPAAS P DTAAH 

RSPSEAADKVOvUSERBPQKNRSSNASBBRACEKKDPATQQAPV 

FGQNLRDKVKLlNBSVDSADMBllAGRPSADTPTAlliYFLQYlSS 

slen31TJSaiiassnxfvfgqnhskrvi>sppiqjNEvsshanrena 

aaesgsbsssqbatpkkbslabsaaaytkatarkcllhkvevit 

gbbaesnvlgkqcklpvfdktsqswvergrgllrlndmastddg 

TIOSR LSDAGPRGSLR \LI LNTKLHAQMQIDXASEK\S ZRlTAM 

dnkdqgvkvflisasskdtcqvyaalhhrilauishveqbqbak 

MPAPBPGAAPSNHEDDSDDDDVLAPSGATAAQAGDBGDGQTTG3 

T 



6069 



27 



ptrpgqrgsssamaaqrlgkkvlsklqspsrargpggspgglqk 

RHARVTVKYDRRKLQRHIiDVBKWIDGR LKBLYR(^iBADKPDKIN 
IPELligLKSBBSRSRiaQGLLKSOGKPVEDFlQSLLAKLOGIiHR 
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Amino acid segment containing signal peptide 
(A-Al aninpj C^Cyst eine , D=Aspartic Add, B 3 
Glutamic Acid, Phenylalanine , Glycine, 

H=His tidine , I=Iaoleucine , K=Lyslne, 
L*»Leucine, ►Mtethionine, N=Asparagine, 
P«Proline, Q=Glutamine , ft^Arginine, 

S«Serine, T=Threonine, V= Valins , 
K=Tryptophan, YsTyrosine, 3fc=Hnknown r *=Stop 
Codon, /^possible nucleotide deletion, 
\=posaible nucleotide insertion) 








Q\pSlMPSPSP\DGQPSAPPQGPGAirEASPLTLlALPPGPPfiR 

RPALLCVL6CI 


$070 


470 


058 


XRVTVDGKPLHV IPPhQFbDSPEW/ RFTKTHRGRHF\QVTLXAE 
TDOrrVSHRRKKLYIiLFAQHRYI SRLFSVIiIGSDIADKLTAlND 
HVYIGKRYHYDIEIiPHPYQMSTPBIRRSPLTQHPQRSRHTK 


6071 


2 


1654 




6072 


1 

‘ 


742 


VLPSRRTSKQQVYPFLPHDV1.1 1TKKKS BBSYNVHDYSLRDQIaL 
VBSCDNBEUJSSPGKHSSTOLYSROSSASHLFTLTVLSNHANBK 
VBMLLGABTQSKSARWITA1/5HSSGKPPAE5PT8LTQVBIVSSPT 
AKQPDKLSK3VADVWW \YBRVSDOinrB&ER\URI>GH^^ 
CAHBIItXyvriDKNVKRMGRYiIdiBTMV 


6073 


$20 


860 


P CRRGIARPLS RRPG/ S I X.VHCAVGVSRSATLV LAYU'lLYHHJUT 
LVBA I HKVKDHRGI 1 PNKGKl^QLUMJJRRLRQqiiEA 


6074 

1 

l 


i¥b “ 


1110 


PGRSCMATBIiQCPDSWPCHSQQVN3ASTPSPBQLRPGDI*XLDHA 
GGNRASRAKVILLTGYAHSSIiPABLDSGAOGGSSLFSEGSSGSG 
DSSSYBAPAGSSPLEDCmSl^IGAQLKIiPWnXJIRKIiQTI IR 
DKTASRGDPMPSADRLIHLWBEGMlQIiPVKEaiVTTPTGyKYB 
OVKFBKONCGVS IMRSGEAMKQGIiRDCCRSIRIGKII*IQSDKET 
QRAKVYYAKFPPDIYRRKVIiIJdYPIlXIlt3\lflTVIKAVKVLJKHG 
VQPSVI ILI*SIiFSTPHGAK3 1 IQEP PKITI kTTKVHPVAWHFG 
QKYPGTD 


j 6073 

i 

! 


320 

i 

1 


1091 


P>ldOP^BVKHH\TGYVPIDGHKTLPSJlCHQCVIVSS6SHLLGT 
jO^GPBIBRABCrriraCYDAPTTGYSADVfflncrTraVVAHSSVFRV' 
LRRPQSFVKitrP if fVP IPWGP PS KMQKPQQ8LVRVI QRAGLVPP 
MMBA^VSPGHMRQFDDLPRGETGiaiRBlCSHSHLSTGWFTHVIA 
VBICTHVHVYGMVPPHYCSQRPRIiGRMPYHXVBPltBPDBCVTYI 

QNEHSRKGNHHRFITEKRVPSSNAQLYGITFSHPSWT 


j 6076 


j 1721 

< 

j 

i 

\ 

i 

i 

j 


107 


HPS PTEAPRVQHLTTOCTWRJ LFLVAAA’rLnuAQVQLV QSGABV 
KKPGASVKVSCKVSGYTLIBLSMHMVROAPGRtSLBVfMGAPDPBD 
GBTT£AQKPQGRVtMTBOTSTDTAYMEI^SX41SBDIAVYYC3CTD 
HGDYAFDBTCQGTHVTVSaAPTKAPDVFPIISGCRHPKDNSPVV 

iaci»itgvhptsv\tvtwimgtqsqa\qrtfpbiqiirdsyymts 

SQI^XTLQQVRQGBYICCVVQHTASICSKKEIPHJfPESPKAQASSV 

PTAQPQAEGSLAKATTAPArnagTGRGGBBXXRgKgKBBQBERB 

TKTPBCPSHTQPLGVYIJ,TPAVQDIJtt*HDK3iT?TCPVVGSDLKD 

AHLTWBVAGrVPTGGVBBGUKRHSNGSQSQHSRLTLPHSIjWNA 

GTSVTCTLNHPSLPPQRLMALRBPAAOAPVKLSLHIiLASSDPPB 

A\ASWLLGEVSGPSPPKILLMHLBDHQBVHT3GPAPARPI*PKP\ 

RSTTPWA\H3VLRVPAPPSPQPATYTCW5HKDSRTIiLH7VSRSIj 

BVSYVTDHGPMK 


6077 


3687 


1268 


LLPDMNLQPl FW IGLISSVOCVFAQTDKNRCLKANAKS CGKCIQ 

AGPNCGWCTHSTPLQEGMPTSARCDDLEAliKKKGCPPDDIENPR 

GSBDIKKNKHVTHRSBGTABKLKPBDITQIQPQQhVLRLRSGBP 

qtptlkfkraedypidlyyi^\disysmkddlbnvkslgtdi^n 

BMRRIT^FRIGP^PVEICrVHPTIBTTPAKLSHPCTSEQNCTS 

PP3YKNVL3IiIHKEBVniBI»VGKQRISQn»DSPBGGFDAIMQVA 

V003LIC3?RyVTRl,T,VFSTDAGFHPAGDGKLg?IVIiPHDGQCHL 
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6078 



1926 



180 



1586 



141 



6080 



1199 



6081 



866 



6082 



283 



1288 



6083 



1865 



309 
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Amino acid segment containing oignaJT peptide 
(A^Alanine, C= Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F° Phenylalanine, G=Glycine, * ' • 
Histidine, I^Iooleucine, K=Lysine, 

Leucine, (4=ftethionine, Asparagine , 

P^Proline, Q=Glutamine, R=Arginine, 

S^Serine, T«Threonine, V^Valine, 

WaTryptophan, YoTyrosine, X^Dn known, *«=^Stop 
Codon, /^possible nucleotide deletion, 
\=po3sible nucleotide insertion) 




EHNMYTMSHYYUYPS IAHLVQKLSQNN I QTI PAVTBBFQPVYKB 
LKNLXP KSAVOTL3AN33NVI QL IXDAYliSLSSKVI LKNGKLSB 
GVTI3YQ5 Y\ CBNG!VNGTGB)3GRKCSNISIC3)EVQFKISITSNK 
CPKKDSDSFKtRPLGFrBSVEVXLQYlCECBCQSBGIPBSPEGCH 
EGNGTFBCGAC3tC2fEGRVGRHCECSTDBVHSBDIGCFEARKENQ 
PQKSASNHGRVPSAGQCVQaHHIMraHXYSGKFCBCDNPBCDRS 
MGI*I CSGNGVCKCRVCECNPNYTGEAQJCSIiDTSTCEAShDQIC 
NORGICBCGVCKCTPPKyQOgrCBMCgrChGVCABHKRCVOCRA 


raOGBKKDYCTQBCSY PNl'nCVBSRDKLPQPVQPDFVSHCAKKl) 
vddcnfyftysvrgnnevhvhwenpecptgpdiipivagwag 

IVLIGLAlIildlfKLUlIIHDRREFAKFBJCBKHNAKWDTGBNPIY 

KSAVTTWNPKYBGK 




BTEDVMKU^KEDLYCP 1CCSLPDDPRVLPCSHNFCKKCLEGILB " 

GSVRNSLWRPVPFKCPTCRKKTPSYWELIPliQVNYSLKGIVEKY 

NRIKISPKMPVCKOT\l^PUfIP\CL\TDW0LDL/OGIC\ATR 

GEHTKHVFCSIBDAYAQBRDAFESIiPQSPBTWKKQilALSRliPTL 

BTS KRKSliQLLTRDSDKVKEFFBKLiQHIIiDQiaCNBXIiSDFBTKK. 

LAVKQAYDPBIMKUfTILQBQRMAPNIABAFICDVSRPIVPIjQQH 

QEFRSJdKVIKBTPIiPPSNLPASPIjMKNPDTSQWSDIKLVDVDK 

LSLPQDTOTFISKIPWSPYKLFLLILLt/JLVXVFGPTMFLEWSL 

PDDIJVTWKGCLSNPSSYLTKTADPIBQSVFYWBQVTIXnTIPHB 

RFKNPTLVVLNNVAKPVCKYKLL 




ATARDLGOURRIDR WMB3TP3 RGIJNRVHLQCRNIjQBFIjGGLS P 
GVLDRLYGH P ATCLAVFRKLPS LAKNWVMRMLYLBQPLPQAAVA 
LKVKKB f^SKAQEBSTGLLSGLRIWHTQLLPGGLOGLI LNP IFRQ 
NLRXAIiLOGGKAKSDDYSQLGPDKHARnvPSliDKYAEKRWKWL 
HFNVGS PSAAVSQDLAQLLSQAGIJIKSrKPGaPPCITSAGPQPL 
LLDTPAOLHYPMLQYIiQTAQSKGMDLVKXLSPLPQLSFSTIiGKD 
ySVEGMSDSIJJIFIXIHLRBFGLVFQRKRKSRRYYPT/RALAINL 
SSGVSGAC3GTVHQPGFIV\VBTNYRLYAYTBSKLQIALIALFSE 
MLYPFP\Nmnr\ARVTR\BSVQQAIASGrtAQQI IHFLRTRAHP 
VKLBQTPVLP PTITDQim»WEIjBRDRIiRFTEGVLY3!?QFLSQVDP 
ELL\LAHAPKLGVLVFB /NTPAKRIJIVVTPAGHS'DVKR FWKRjQK 
HSS 




IBTI DHVGEPAMAAOAAGViRQRAATQGU^QHALKYLGCDFK 
TLHQ<K^14)SGVLFKDPBFPACPSAXiCjY2GDZiGPG8PQTQGIIHKR 
PTBZiCPSPQFIVGGATRTDICQGGIiGPCHLlAAIASL'XUiBBIiL 
YRWPRDQDPQBHYAG1PHFQP1iCPPS?\F»QYGBNVBVVXDDR 
LPTKNGQLLFLHSBQGNBPNSALX^XAYAKLBGCYBALAGGSTV 
EGPBDFTGGXSBPYDLKXPPAOTjYOI IRKALCAGS LLGCS I DVY 
SAAE3\EAXTSQKLVKSHAYSVXGVBBVNFQGaPBXLIRLKNPKC 
EVEKSGANSHDAPEWNHXDPRRKBBLDKKVEDGRPWMSLSDFVR. 
QFSRLBI CNLSPI1SLSSBHVHKWNLVLFNCHWTBGSTAGGCQNY 
PGSS 




BMLPU^I^IJJtA/GAI^riARFHl^«PBSVTVQEGIiCIPVHC 
SVFYLBYGWKD3TPAYGHWFRBGVSVDQBTPVATOHSTQICVQKB 
TCXn?FHliLGDP£RNNCSLkSIRnARRRI8K5SyPFWVARGRTKFSY 
KYSPLSVYVXALTBRPDIUCPBPLKSGHPSNLTCSVPfiVCEQGT 
PPIFSWMSAAPTSIiGPRTLHSSVIjTIIPRPQDOGTHLIOQVTPP 
GAGVTTKIOTQLSVSWKSGTVRHVVVIxAVC^VAVKIIXIXXCLI 
I LSPHKKKAVRAVBVKKNVYAVM3 




SARSPGPTOTRTAPGX*AAPGLAQPAAIiRIiLLSRPPSAAMDGDGD 
PBSVGQPEEASFKEQPBEASAEKERPKDQQEKEAAAAA\Y\LDB 
IiPKPLLA/ LHVLAALPRHB \l»VQACR\l*VCLRi#KBL VDGAP LWX, 
UCDQQBGLVPEGGVEEERUHWQQPYPLSKRRRNLIiRNPCGRBDL 
KGVCDVBHGGDGWRVBBLPGDSGVBPTHlffiSVKKYFASSFBWCR 
KAQVIDI<)AEGYWBBIdJDT3VPAIVVKDWYSGSSnAGCLYBLTV 
KLLSKHBNVLABFSSGQVAVPQDSDGGGWMBISHTFTDYGPGVR 
FVRPBHGGQDSVYKKGWFGABfVTNSSWVKP 
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Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, B» 
Glutamic Acid, F=PHenyl alanine, G=Glycine, 
Bsftlstidine, Msoleucine, K=>Lyaine, 
L=Ieucine, M*Methionine, tfeAsparagine, 
P-Pxoline, Q=Glut amine, R=Arginine, 

S=Serine, T=Threonine , V=Valine, 

BsTryptopban, YnTyrosine, X=»OnXnotnQ . *=»Stop 
Codon, /=possible nucleotide deletion, 
^possible nucleotide insertion) 



DVQKBT0IJS*^DSVCTIAKLHDSKMPABI>9fKI£BylSKQftKA 
SEVMGPVEAAPBYRVTVDiANMliTVBIEirBIJrilHKFXRDKYSKR 
reSLESLVPNMJDYIRTVKELGNSLIJKCKHHQJtiQQrLTNATlM 
VVSVTA3TTDGQQL3BKELERLKKACDMALgIJtA3KHRIYFyVE 
SRMSPIAPm*SlIIGASTAAKIh^VAGGLTHLSEMPAariMIdiG 
AQRKTLSGFSSTSVLPHTGnflYHSDIVOSLPPIPPPPSVAPVDL 
ERKAARLVAAKCTLAARVDSPHKSTBGKVGYKIaKDBXBRKFDSW 
QEPPPVKQVKprJ>APLBGQRKKRGGRHYRKHKKaLGLTBlR\KQ 
AEndCFGRIBEDAYQSDLGPSLCTLGKSqSGHVRQT QVK BATKA 
RISJCITXJRTLQKQSW YIK3KSTXHDRSSOTA3SVAPTPLQGLBI 

vnpqaaskkvaearqkyfssmakplkvkgbksguist 



HQurATORfr f^Mfrrj vnPT.rjvr>T.RRan.gT 7 KRr^«y nvgwgvp 

DVQBBlUUn^a?SVKTIAKLWDSKHPABIKHKlKEVTSKQWCA 
SKVMGPVHAAPKYHVTVI1AKNLTVKIKNELNI IHKFIitDJCYSKH 
PPSLgSLVPHAlJ?YIRTVKKI/»NSI/DKCKHHEHLOOIliTNATlrt 
VVSVTASTTQGQQLSBEgl^RLBBACDMALKUPVSKHRXYBYVB 
SUMS PIAPNLSIITGASTAAKIHOVAQGI»TNIiSKMPAC3fIMIJjG 
AQaXTl£GPSSTOVLPHTQYrraSDrVQ3LPPIPPPFSVAP\DIi 
RRKAARLVAAKCrLAARVDSFHBSTBGKVGYgLKD KIERKyDK W 
QBPPPVKQVKPLPAPIiDGQRlCKRGGRHYRKMKBRI/5bTBIR\KQ 
ANKKSPGBlBBDArQEDbGFSLGHLGKSGSG&VBQfrQVKBATKA 
RIS!CTbORTIiQXQSVVYGOKSTIRDRSSGTASSVAPTPDQGI«BI 
VHPQAAKKKVAEANQKYPSSMABFLKVKGBKSGLHST 



SOPaSPQGamAVGRISLGGKRKPBVTI*LPGVS3KRVRRWRRAKV 

GVMIVXPGNPWKPBPATQVPR/VPAQVYLPGHGPPLRK3KBLVM 

nBEAYVLYKRAQTQAPCLSrorVRDHLGCMKTBLPtiTLYIiCAGrr 

Oabsaqsnrwklrmhnlhgtkpppsegsdhe^kbedbbdbbbr 

KPQLKbAMVPHTGGXNRVRVSWLGKBPVAGWSKKGQVBVPALR 
RLK)W2BPQALAAPXiU)BQAQMKPI PSPAGMMGEGPALTWSPR 
VTGRLLTGDOQKNIHIiWTPTDGGSWHVDQRPPVGHTRSVBDLQH 
SPTENTVFASCSAlJASIRITOIRAAPSXACMLTTArAHDGDVNV 
ISnSRRBPFLLSGGDXXaAUCIHDLRQPKSGSPVATPKQHVAFVT 
SVBtlRPQDSGVPAASGADHQlTQWDLG/ XVSRDPBAGDVBADPG 
LADLPC2QIiliFVHQGBTEZiKELHHHPOCPGLLV3TAI*SGPTI PUT 
ISV 



GAATQHQGAKHLLPQTPHGNGbLYAG PHQ 
YNTDPblCBKBKQBFI>EGGVGHVEMLFRCNYLALVGGGKKPHYPP 
NKVMIVfEDLKEQCTVXBIBFSTKVKAVXLRR\DKIVVVUX!MIKV 
FTBTHNPVHQUryra\TCTNPKGIiCVIjCPN 5 NN 3 LLAFPGTHTG 
HVQLVPLASTBKPFVDXPAHBGVLSCIAI^II^GTR IATASBKGT 
LlRlFDTSSOQilQBLRRGSQAAHIYCINFH^ASLXCVSSDHG 
TVHIPAABDPKRNKQSSIiASASFLPKYFSS KStSFSKFQVPSGS P 
ClCAFGTKPtZAVlAX CADGSYYKPLFNPRGBCIRDVYAQPLEMT 
DDKL 



QNSQRTGljPrTIPSRSPPLLTGSDLCSNMPCTCTWENWRQWXRP 
LVAVIYLVSIVVAVPLCVWBtdQKLEtVG IHTKAMPlfiQI FLLLTI 
PI6LWVILQIILVHYTQPELQKPI IRILVfMVPI YSLDSBIAIiKYP 
GIAIYViyTCRECYBAYVIYNPMGPLTNYLTNRYPHLvLIIjBAKD 
<X)KHFPPLCCCPPtlAK5BVLLPRCKIi3VLQYTVVRPPTTTVAIjI 
CKLLGIYDKGNFSFSHAWTYLVI INNMSQLPAMYCLLLFYKVLK 
EKLSPIQPVGKFIjCVKIjVVFVSFWQAVVIALLVKVGVTSBKHTW 
EffQTVKAVATGLQDFIICIBMFLAAIA\HHYTPSYKPYVQKABE 
GSCPDSFIAMWEVSDIEDDXSBQVRHVGR'rVRGHPRKKIiPPEDQ 
DQHBHTS liSSSSQnMS IASSMPPSPMGHYQGPCanVTPQfTTP 
TTAKISDBTbSDTIGBKKBPSDKSVDS 



GASGLVKLLOQGHRCLIAPVAPKLVPPVRGVKKGKHAAFRPQKE 

LKRQ3LLRCPPPPVRRSBKPHW0YHAKIQAPC3ffiLQKIfPSUlLIi 

KTAPVHSCYIK5EEAHBQQl£rEKEAVW(IJCSBQSL3£QGTSP 

SQTCLT'QPtEDKYPOMPTEGIKNLVDFbTGKBVVCHVAltNLAVB 

QLTLSBBPPVPPAVLQQTPFAVlGALLQSSGPBaTALPXRDFLI 

TQPfTGKBLPKMWKIINPKGIibVBEbKKRNVaAPESRLTfiQSG\A 
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Amino acid segment containing signal pepti 
lA=Alanins, CoCysteine, D=Aspartic Acid, 
Glutamic Acid, F«*Pheuylalanine . G=Glycijae, 
H*Hiotidine, I°Ieoleucino, Lysine, 
L^Leucine, M=Methionine, N^Asparagine, 

P=> Proline, Q=Glutamine, R^Arginine, 
$=Serine, T«ThLreonine , V=Valine, 
©=Tryptopban, Y-Tyrosine, XoUnkxtown, *=5top 
Codon, /^possible nucleotide deletion. 
\=possible nucleotide insertion) 




TRUSlPGSTXSSRPRIiCALAAEGHFDGHSirrGSRAGAHTGAPAW 
PSRRliRDLPAGQ'WRLRRAAVACKVCXJSLVKHSSGI KGSLPLQK 
LHLVSRS I YHSHHPTXiXLQRPQLRTSFQQPSSLTOLPLRKLKFS 
PXKYGYQPRgNFWPARTATRT.T.KTJUTLILGSAVGGGYTAKBgro 
QHXDMI POLSBYKHXVPDIVWE IDBYIDFBKlRKAbPSSEDLVK 
LAPDPDKIVKSLSLLKDPFYSGSPEBXAPaATDRGSESDKHFRK 
VSDKEKIDQLQBELbBTQIiKYQRICiERLBKEHKEZiRKLVLOKDD 
EOlPPZBSLRXSLIISfrSBVldTVbSDYDAS'TRTQCHLPRVWVG 
DQSAGKTSVliBMlAQARIFPRGSGBMMTRSPVICVTLSEGPHHVA 
LPKDSSSKPDIiTKKKDLAALRHKIKLRMRKNVKEGCTVSPBTIS 
LHVKGPGLQHMVLVDIiPGVINTVTSGMAPiyTKBTIPS ISKAYMQ 
DPNAJ ILCIQDGEVnAKRS IVTDIiVSOCDPidlRTIPVM'KVDL 
AEKNV7VSPSiaQQIIBGKLFPMKALGYFAVVTGKaKSSBSrKAI 
RBYBBBFFQ^SKXAKrSKZiRAHOVTTRNLSLAVSDCPHKNVRBS 
VEQOADQFKATRFrajBTEireHKYPRLRKIiDRirBLFKKAICNBILD 
EVTSIiSQVTPKHKKRIlOOSLWKRVSTHVlKlSlYLPAAQTHNSG 
TFHTTVDIKL3COmi>KQLPNKAVBVAHKTLOKKPSRFPrrKPKGK 
BHDDIPDKLKKAVKBBSIKRHKWNDPAEDSLRVI^IHALKDRSI 
SDKQQWnAAI YPHKRALOARIlCDTR!2AXENMVQPD\WKKRHLyW 
KNRTQBQCVHNBTKNKLHIWLKClJBKHPAYLASDBITTVRjraLB 
SRGVBVDP5I>IXW1TO(JVTRRHFT*KIALi*HOfLCRROTYYYQKH 
FVDSBLBCHDVVLPWRIQaHLAITAHTtiBQQLTHTBVRRLBKNV 
KBVLHDF7iKDGEKKIKIJ»TGKRVQLAEDLKKVREIQKKLDAFXB 
ALHQBK 



PVFVPAPGAVIiEOAS /ASPPLATQTWPU^HCKI PRLPVQAS XL 
FKLQLFFCQLIAIiFVHYIMIYKTVWWYPPSHPPSHTSLNFHIjID 
PNLLMVTTrVLGRRFIGSrVKEASQRGKVSLPRS ILLFLTRPTV 
LTATGWSLCRSLXHLFRTY5 PUJLL/FPLLSVWDVHSVPAAB1A 
P\RKTS LPNHMASMGPRBAVS GLAXS RDYLLTLR VKR3SSTQDS 
CMARTPCP/PHACXILSPSLIRSBVBPLKMDFNHRWKBVLVSSML 
SAYYVA?VPVHPVJCrrHYYDKRWSCSbFIXVSISTSVIL>ftlHI»I- 

PASYCDIIjHKAAAHLGCWQIVDPALCSNVIjQHPWTKBCMWPQGV 
LVKKSKNVYXAVGHVNVAI PSDVSH PR PHPPPSKPLRIUJllilJj 
LEGA.VI VYQLYS LMSSEKWHQTI SUULI LFSNYYAFPKLLRDKL 
VLGKAYS YSAS PQHD1DHRPS 



SOTTHBMBBKBILRRQXRLLQGLIDDYKTUIQWAPAPGTPAASG 
WQPPTYHSGRAFSARYFRPSRRGYSSHHGPSWRKKYS lvnrppg 
PSDPPADHAVRPLH( 5ARGGQPPVPQQHVLBRQV0I>SQGCWVVI It 
VKPPSKSGSASASGAQRGSLHBFBDTPffSDQRPRBGEGEPPRGQ 
LQPSRPTRARGTCS Vg DPLLVOQKBPGKPRMVKSVGSVroSPRB 
PRRTVB RSVIAVKAS FPSSAliPPRJGVALGRKLgSHSVASCAPQ 
LLCaiRRVDAGHrDQFVPSGSVGGPARPASGPRaASRASLVVTCS 
I WK PR KHHY KHVAASSKSPHVARRALSPRVAAEHVCgASAGMAg 
YVBXPQLIADPBPKPRKPATSSXPGSAPSBOfKHKASSPaASSSS 
8PRHQSBAG3KDHASQLSPVLSR3PSG©\RPAVGHSGIiKPLSGB 
TPLSAYKVKSKTKIIRRRtSSTSIiPGDKKSGTSPAATAKSHLSLR 
RRQALRGKS3PVIiKKTPNKGLVt3fVTraRIjCRLPPSRAHI»PTKKA 
SSLHAVRYAPTSKVIKTRYRIVKKTPASPLSAPPTOLSLPSWRA 
RRLS LSRSLVLNRLRPVAStSGGZAjQPGSPmfRSRZTYRCIGGVLY 
KVSAtnCLSKTSGQPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 
RSLAI IRQARQHRBKRKHYGKYYHRFGRC3IRGERCP YIHDPEKV 
AVCTRFVROTCKKEDGTCPFSHHVSKBKMPVCSYPLBaSI CSNSN 
CPYSHVYVSRKAEVCSDFLKGYCPLGAKCKKKRTLIiCPDFARBG 
ACPRGAQCQLLttRTQKRHSRRAATS PAPGPSDATARSRVSASHG 
PRKPSASQRPTRQTPS3AALTAAAVAAPPHCPGGSASPSSSKAS 
SSSSSSSSPPASU3HBAPSLQRAALAAACSNSLCKLPSFISLQS 

spspgaqphvraprapltkdsgkplhikprl 



AKAPPTGBSSBPBAHVmTKRLYRAWHAVTrRTiDT »TLQnCEAYQ 
BVFIPBNISLRHKMKLCVKIMPLHPVOTGRXAHEIjIJIRKVYYB 
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Amino acid segment containing signal peptide 
(A^Alanine, C=Cyateine, D=>Aspartic Acid, B= 
Glutamic Acid, P» Phenylalanine , G=Glycine, 
H=Histidine, I«=Ieoleucine, K^Lysine, 
L^Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q«Glut amine, R=Arginine, 

SaSerine, TVThreonine, V-Valine , 
W«Tryptciphan , Y^T/rosine, X«*Unknovn, *°Stop 
COdon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 




1 




VlQlJnCTNKKHXUSRSTIJXIAYRTHLVAGIGFYQHIJJjYlQSHY 

QI£LQCCXDWTHVTDPI*IGCKXPV5ASGKEKDWA01ACHRn*VY 

ICTI^RYQNKLAGVDTBLXAERFYYQALSVAPQIQtP PNQIGTX* 

AGSKYYNVKAMYCYLRCIQSEVSFEGAYGNLKRLYDKAAKMYHO 
LKKCETRKLS PGKKROCDIKSIl>VIlFMYLQSLIiQPKSS SVDSKL 

TSLCQSVLBDFNLCbFYbPSSPNbSliASBDEKBYKSGYAFLPDL 
LITO<VllCa^tCVHSZ*BRAGSKQYSAAlAyTIiALFSHI»VNHVNI 
RUQABUBEGENPVPAFQSDGTDBPESKBFVBBBBBPDPBPPPVT 
PQVGBC5 HKSRKFSRIiSCMUIRRHPPXVGIWSSDI*SBGFBSDSSHD 
SARAS BGSDSGSDKSLEGGGTAFDABTDSEMNSQESRSDIiKD ME 
KEKGTRSFTbEPPRGRfiBAPDSI^PI^PSKASlASNDQAASTQ 
MPQfTKRCPRIAPTFSNIJLl<)PTXNPH , lSASHRPCVNGDVDXPSB 

PASBBGSESBGSBSSGRSCRNHRSIQBKLQVIiMAKGLI^PAVKVF 
UmOTNPDXJC IVCAQSSQSUJNRIiSVLmiJjPAAGKIiQKSGIA 
IjCPKVQDXiIiBGCEIiPDLPSSIiliLPBDMTaJtNLPPLRAAHRRFNP 
OTDRP1J^TIiKBSVVR1(XIRSPGHFIAHWSI1^FNPKVGIP 
VS IAQS BQBSIJiQQAJQAQFRMAQBKARfWRlMRDHAQIfilXZI^KV 
SQLEGSIXJQPK^QSAMSPYLVPDTQAI/JHHIiPVXSQIAXSQlFI 
VI I prtvidgldllkkkhpgahdgxrylbabfkkgnhyiroqkb 
VGKSFKRHKLKRQDADAWTLYKIUDSC 3CQLT\LAQSAGSBDPSG 
NVT I IIt5bPU3NPSI«LSGPMQAAI4)AAAHASVDI KNVtiDFYKQW 
KBIG 


6093 

I 

i 


76 


1002 


ACX3RHAMLALRVART/ SRWGAL \RGAVW APGTRPS KRRACwAUj 
PPVPCCW3CLABRWRLRPAAJU5IiRI»PGI< 3QBNHCSGAGKAAPR\ 
pAAGAGAAABAPGGQWGPASTPSLYBNPWTIPNMLSm'RIGbAP 
VU3YDI IBSDPHIAU3VFAI*AGIiTDI»LDGFIARNHANQRSAl£S 
ALDPLADKIlilSIbYVSIiTYADLIPVPLTYMIISRDVMLIAAVF 
YVRYimiPTPimiAiCYFNPCYAIARI^PTFISKVNTAVQLILVA. 

ASlJVAPVFNYADSIYLQILWCFTAFriAASAySYYHYGRKTVQV 

IKD 




23 


1010 


PF£RCLKGIX2KAKMSKIIKVIjNxYYPPDFDPSKIPKLKi*PJUJRQX 

VVRIWAPFNMRCKTOGBYIYKGKKFNARKSTVQJIEVYIiGLPIFR 

FYIKCTRCIiABITFKn)PENTDYTMHHGA31lNF0ARKl*DEBEEK 

EVQKERKDBFLNNPfOCVbKHRlTOSKIiBMKVItBNUJBLKDIilQH 

OAHVDFEAMDRQHRbSBBBBRRQQQBKDBQBTAALLBBARKRRb 

LSDSDSEDKAAPSPlvQPALRPNPTAXIiDKAPKPXRKVKVWEQSV 

gslgsrppi^rdvvvxxaradpdcsngqpoa/aphprspafqbg 

GQPYTPDAKRVbPEPTGCIPGQ 


■ 




1599 


trgraakrsrghgbg flggg fa\ swdyfpsedfyrcgycknbs 
gsrsngmwahsmtvqdyqdlidrgwrrsgkyvykpvmnqtccpq 

YTIRCRPLQFQPSKSHKKVIjKKMLKFIAKGEVPKGSCBXDBPMD 
STMnQAVAGDFALINKLDIQCDWCTLSDDI KBSLESEGKNS KKB 
bpqbllqsqdfvgbklgsgbpshs 



TRADOCS:J4tfi257.l(%CSH01 IDOC) 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 




Amino acid segment containing signal peptide 
(A^ Alanine, C=Cysteine, D=iAspartic Acid, 
Glutamic Add, F=Pbenylalanine, G=Glycine, 
H=Hist idine, 1= Isoleu cine, Lysine, 

Leucine, M=We thionine , N-Asparagine, 
P=Proline, Q=Glutamine , R-Arginine, 

S= Serine, T-Threonine, V-Valine, 
H»Tryptophan, Y=Tyrosiae, X=unknown, *=Stop 
Codon, /-possible nucleotide deletion, 

\ impossible nucleotide insertion) 



VKVHTVPKPGKGADLSKPPOlKAKBlIUCSRKHLBa^lOQNPAGKL 
BGFQAOGHPPSLFPPKAHSNQPKSLEIlLIFKSLPENASHKLBVR 
WBSSPPSSQTOVTLLBSYQVfYKBYQMVI H KHPPDT P TBSQPTB 
FLCSSPLBABTPPNGPDOCYGSPHQQYWLDOKI IAVGVIDILPH 
CVSSVYLYYDPDYSFLSLGVYSAIiRBIAPTRQLHKICTSQLSYYT 
MGFYIHSCPgMSYRGQYRPSDLIfCP B T YV W V PIBQCLPSLEHSR 
YCRFNQDPKAVDEDRSTBPDRLQVPHKRAIMPYGVYKKQQKDPS 
BBAAVLQYASLVGQKCSERMLLPIOT 



QRVRAALLSSAMEnSBALGFBHMGLDPRLLQAVTDLGHSRPTLI 
QBKAIPIAI£GKDZjIJVBART6SGRTAAYAIPMLQ1iXiUSRSQVIGP 
VVRC^VRGLVLVPTKKLARQAQSMI QQLATY CARDVRVANVSAA 
BDSVSQHAVLM^PDVVVGTPSRJLSHLQQDSliKIiRDSLBLLVV 
DKADLIjFSPGFKSEnJCSljLCHIipRIYiQAFLMSATFNBDVQALKB 
LILHHPVTLKLQESQLPGPDQLQQFQWCBTBBDECFIiLLYALLK 
LSL1 RGKS1 iLFVNTLBRSYRIjRLFLKQF 3I PTCVLNGSLPLRSR 
CHI ISQPNQGFYIXrVIAllBVEVLGAPVKSKRRGRGPKGDKASDP 
KAGVARGIDPHHVSAVIiNFDLPPTPKAYIHRAGRlARANHPGXV 
LTFVLPTBQFHLGKIKBIiLSGKNRGPrLLPYQPRMBBIEGFRYR 
CRX>AMRSVTIC0AIRBARLKBIXBBZiIJI5BKLK7YFBDNP&\mX) 
LUlHDLPUlPAVVKPHLGBVPDyiiVP PALRGLVRPHKK\GRSCL 
PLVGRPRBQSPRTHCAASSTKKRKSDPQPSPPBVVGPLWS 



apgthsggkickssfqitsvttdybgpgspgasdpptpqpptgpp 

PRLPKGBPSPDPGGKGTPENGSPPPGAPSSRFRWKLPHGLGEP 

YRRGRWTCVDVYKRDLSPHSFGGLLEGIRGASGGAGGRSLDSRL 

klaslglqaptppsglsqgptswlrppptspgpoarsptsglgq 

LWPSKAKAJSKPPLSASSPQQRPPBPETGBSAGTSRAATPLPSL 
RVBABAOGSGARTPPLSRRKAVDMRliRHELGAPBBMGQVPPLDS 
RPSSPALYPTHDASLVHKSPDPPGAVAAQKPSLAHSHLAISGHL 
DSDOTSGSGSLVG I13NRIKQAMDLVKSHLMPAVREKVKVLKBQI 
RgLAERHAAIiBQKHGItLRALA\SPEQLGSAGPPRGVPR\DGPPA 
PKSPPVLSLPSLTI VPLGLPGLASAAWPPLPMPALIVPVFPGVG 
VQALSNGPWSPGPLPHLLI IPSUX5GGEGFRTGRQQGAPPGBBT 
QPPPSLPGTPQO 



NYCLKHRSP 



t i* « z 1 V n »>»:» KV; 1 j-gj ;J 



RLNSASQRSSSLPPSHRKSSTPTKRKIMLTPVTVAYSPKRSPKB 

I^PGPSHI^K13BSSPIRFDIU^IJrn^VSrnX2RTm>R^L 

\QFIiPLDDSBEK\TYSKKAT\DNIVNHSSCPKFVPNGVKKVSVR 

tawkkhksvsybqckpvsvtpqgndfbyiakirtlabterfp\d 

ELT KHKDQI BAALSRMPSPGGRITLQTRUSQBAPGRSPGKD 



NYCLRHRSPLBKDSSPGSSSTSLLIKKQHBTSDTPIURAIiKBljt) 

RLNSASQHSSSLPPSHRKSSTPTKRBIMLTPVTVAYSPKRSPKK 

HLSPGFSHLLSKNBSSPIHFDILLDDIiDTVPVSTLCRTHPRKQL 

\QFriPLDDSBKK\XYSHXAr\DBIVNHSSCPRPVPMGVKKVSVR 

TAH3KNXSV$YEQCXPV5VTPQGNDFBYTAitIRTLAETERFF\D 

HLTKKKDQIEAALSRMPSPGGRITLQTRIjKQEAPGRSPGKD 



FVBVSGYI^EADPBPRGRIJIMTYAYLFKY I IIGDTGVGKSCLIiL 
QFTDRRFQPVHDLT1GVEF6ARHVNI0GKQIKL0IWDTAGQBSF 
RS ITRSYYRGAAGALLVYDITRRETPNRLTSBLBDARQHSSSNM 
VIHLIG37K5SLBSRRDVKSBBGBAFARB\HGLIFMBTSAKTACH 
VEEAFINTAKEIYRKIQQGLPDVHNHANGIKIGPQQSISTSVGP 
SASQRBSRDIGSHSGCC 



FRGRAHPLRBVSHWLGCRRVCSWSASWGRLPALSARLSPIiLAFR 

GKMVFPLSC3WQ0YAWGKHGSN3BVARIiLAS3DPLAQIABDKPY 

ABWmtHTIPRGnAKIIJJNRISQKTI^QWIAKIfQDSI^KVKDTF 
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location 
corresponding 
to first 
amino acid 
residue of 
aminft add 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
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amino acid 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, Aspartic Acid, 

Glutamic Add, P=Pbenylalanine, G=Glycine, 
H=Histidine, I=Icoleucine r K=Lysine, 
L=^eucine, M^Methionine, N^Aaparagine , 
PaProlinfi, Q=Glutamine, R« Arginine, 

S^Serine , TaThreonine , V«Val ine, 

W^Tryptophan, Y^Tyrosine, XsUokxxotm, * ^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








HGNL^FI^^liSVH^liSiQAHP^KElJlEKl^U^APQHYPbANH 
KPBMAIALTPPQGI2ZGFRFVKKXVTFLKKVPKFQFLIGDRAATH 
LXQTMSHI1S<^VASS1 jQiSCPSHLMRSBKKVVVEQLN1iLVKE isq 
QAAAGMNHEDI PGKLLLQLHQQYPCH)IGCPAI YFLHLLTLRPGB 
AMPl^AHVPHAYIiKGDCVBCMACSnHTVRAgLTPKyiDVPTLCB 
MI»3YTPSS3KDHLPLPTRSQKDPYliSIYDPPVPDFnMKA\BVP 
G\SVTEYKDLAH)SAS ILLMVQGTVIASTPTTQfTPI PLQRGGVL 
FIGANESVSLKLTBPKDIiUIFRACCLJj 


€102 


70 


2415 


0TPQATIAAHGAKDSH£5GS«LPAG3IGASPAAPCCSBSGDERKN 
LBBKSDIHVTVLIGSKQVBBGTCNGDLP3YV3APIBKBVGHDLK 
SLKKliDKLIBQRTVSKKQLBBQVLTZSSBIPKRXRSALKHABBS 
EQFlUQFLBQBTHLPSAXNSHZiLZAQPHMDDIiGTMISQIEEIBH 
KLAYIiKHISQrBKLSDHIQQYLIflHNVPBAASTLVSMAKIjDI Kh 
QESSCTHLIWSFMRATVKFWHKILKDKLTSOPBBILAQLHWPFIA 
PPQSOTVGI»SRPASAPKIYSYMTLFOQIJ»KL(yrSHBljbTBPK\ 
HSQKNTtiFLPPU^S/WPIGVMLTPLQRRFRYHFRGNR&nJVLS 
KPEWYUU3VLfWIGNHTBFIJ3KKlQPILDKVGSLVNARI*EF6RG 
LMMLVLRKLATDIPCLLYDDNLFCHLVDBVXjLFERBIJHSVHCJYP 
GTFRSCMRI LSBBTCTQKWLTVBIUCFALQKMDSMbSSBAAWVSQ 
YKDZTDVDBMXVPDCABTFMTLLLVITDRYXHLPTASRKLQFliE 
LQKDLVDDFRIRLTQVMKKBTRASLGFRYCAILHAVirjriSTVIA 
DHAIB?VFFLGIiQQAALEVFAENZfTl»SKLQLGQLASMBSSVFDDM 
IHIjliKRIiKHDMLTRQVDHV FRBVKDAAKLYKKERWLSLPSQSKQ 
AVMSI^SSACPLLLTLRDHLLQLBQQLCFSLEKI FWQMLVEKLD 
WIYQBIIIANHFNECGAAQLQFDMTRHLFPLPSHYCKnPENYF 
KH 1 KKACXVLNI2AA3SALTAGKDVIiPVQljQGSFP AT 


6103 


207 


2523 

'1 

i 


BSNSTHTTYLBFIQQNBBRDGVRFSHNVWPSSRIjRATRMVVPVA 
ALFTPIiKERPDXiPPIQYKPVLCSRTTCRAVLNPbCQVDYRAKLW 
ACM FCYQRNQFPPSYAGIS KLNQPAEbLFQ FSS I BY WUtGPOM 
PL) FLYVVDTCP^BDZjGAUCSSNQ^LS^PPTALVG^ITFGR 
MVQVHEU3CBGISKSYVFRGTia)I*aAKQIiQBKLGLSICVPVTCJAT 
tK3POVQQPPPSHRFLQPVQKIIXO(I.TIHJiGBLQRDPHPVP06KR 
P1RS SGVALS I AVGLLKCTF POTGAE IMMFI GG P ATQG PGMWO 
DELKTPlRSVfHDIDKENTVICYVKKGTKHFKAIANRAATTOHVTDI 
YACAU)QT01jLBHKCCPHl,TGGYMVWgPSIOT3IiFKQTFQRVFT 
KUMHGQPKMOPGOTIiBIKTPRXBIKISGAXGPCVSLNSKGPCVS 
BNBIGTGGTO0W3QCGLSPTTTLAIYFHWNGHNAPIPQGG\RG 
A\IQFVTUY\QHSSG0RRIJtVTTIAiaAWAnA3QTOiaNlAASFD 
QgAAAXIiM7\RLATYRAin , BBGPPVXRWLDR0LIRIjCQKF0HYHK 
DDPSSPRFSBTFSLYPQFMFHLRHSSPLQVFNHSPDESSYYRHH 
FMRQDLTQSLXMIQPILYAYSFSGPPBPVLLDSSSXLADRIIiLM 
DTFPQILrYBGBTlAQHRKSGYC^MPBYENFRHIiLQAPVDDAQK 
rLH£RFPMPRYIDTBH£X5SQARPl^KVNPSQTHHNMYAWGQES 
GAPILTDDVSLQVTMIHIiKKLAVSSAA 


6104 


124 


732 


KVSBYI ILSKDXIIJ^UAIjAMLVLVVSPtJSAARGVlJltiYPfKRLlR 

KIJ>QSRPGFPSPP«GPALAVQ\AQPCLOSQQMIPVEVKRl/RSL 
LDSIPWMAAPKNRRTIBVHRCRRIDJPQKLIKVKHNIDVCPEOGH 
UQKHVU^CYKKVCKETAKIRfiQIOKQEGQPFKAPTXBTVVL 
ITCBTPSBQDQGKRI IKRDHKRPSHFTQH 


610S 


3 


9B9 


PLBGACTSLVLQRFCHIUlPaPCAPARPEDMRRPAAVPIilXiLCP 
GSaHAKAAXACGRPRMLHHPr/GGQDTQEGRHPWQVS IQRNGSHP 
CGGSL1AEQWVLTAAHCFRNTS BTSLYQVLLGARQLVQPGPHAM 
YARVRQ\rESHPX»YQGTASSADVMiVBLBAPVPPTlfYTLPVCLPD 
PSVI FBTGMHCWVTGKGSPSEBDLLPBI^ILQKLAVPm>T\ PR 
CiHiYSKDTBFGYQPKTIKNDMIiCAGFBEGKKDACKiGDSAGPLV 
CbVGQSHLOAG^SWOEGCARQHRPGVYIHVTAHHlIHIHRIIPK 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



nucleotide 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 





GRPPTAPtfrGRPPTANEG0PRIJ>TtKRGCARbLTSXESKGRPAA3 
| AGLRRDRGUJUtf^LRRAPlJVRATRRRAGSPKRCAPRPRACIW 
! WSRARHQ PGGLqiTJ J J JXQFMgDaSAQRCIJCWLRQAKHGKOQV 
1 LYXTHIi SKKB CCST GR IiSTS W TEHDVlBaiTLPKKNIRiGGAPWC 
I PCKETCBHVDOG PGKKCk MWKKHgPRCVCaPPCSN I 'fW KgP V C 
aUXJKTWWBCALTJMOCBQPBLBVOYQGRCKKTCRDVPCPGS 
STCY\VIXnT<HAYCV TODtICP gPASSaQYIiO( 9 nX?VTTS\SfiC 
HLRKATCLLG&S IGLATBGKCIKAKSCKDIQCTGGKKCLWDPKV 
GRGRCSLCDELCPD3KSDKPVCASI»CVTYASBCAMKEAACSSGV 
LLKVKHSGSCUSrSKDTEEHEEDKDQDYSFPISSILEW 



SRCSS PRFBPGRGRGK/ LSPSBHSKNVBVFKAGDBDHH9TLSRB 
DFCTAWML FU f 2CPSKI BVDSVM33mPOT8GimBQFLHXVRK 
KKBAQRYHKBVRHI FTAFDTYYRGFLTLBDFDCAFRGVAPKLPB 
RTVLEVFRKV\DRDS\DGHVSF 



GGSLRFSPPRVPSGSRVFCPVPPGGOGLPSPnSASRPQSPTTW 
CI^RRYMKHKHDDGPBKQKI> 6 ^VDVTPVKnrCVFVVMCX^SWLVIX 
YYFYDLLVYWXGI FCLASA!TGLYSCIAPCVHRIiP\SASW3BSA 
miPTXPMN3LPYFHKRPQiAHMLUAI»FCV AV S WW G V F HH BDQ 
HAHVLQDAIX3IAPCLYHIJCTIRI»PTFKACTI*LLLVIiFLYDIFFV 
FXTPFIiTKBGSSIHVBVTVTCPSDSATRBKIiPMVUCVPRUlSSPL 
AIjCDRPFSIaLC3FGDILVPGLI«VAYCHRFDXQVQSS!tVYFVACrX 
aygvollvtfvaijvlmqrgq paij^ylvpctlvtscavalhrrkl 

GVFWTGSGFAKVLPPSFVfAPAPADGPCPPKDaATPLSPQPPSBE 

patsphpabqspksrtssemgagapmrbpgspahsbgrdqaqps 

PVXQPGASA 



CRSRAGAASGGAI LEGTXLRRQRVIJTNXPUDPLVFSALRAAMLY 
LBDYLKMIE 0 LPMDIJU)RFTEMRBMD 1 iQVC?IAMDQLEQRVSKFF 

mnakknkpbwrbbqhasikkdyykaledadhkvqlanqiydlvd 

RHLRKliDQBLAKPKMBliBADHAGITSIIiERRSItBIiDTPSQPVNN 

HHAHSHTFVBKRKYNPTSHHTTTDHIPBKKFKSBMtljSTLTSDA 

SKKNTLGCRHNNSTASSNHAYNVNSSQPLGSYHIGSLSSGTGAG 

Gl\TMAAAOMWTAQMKBGRRTSSIJaSVBAPKN!IDFQtGKEP 

SMARBTVGYSSSSAI^iTTLTQHASSSAADSRSGRKKKNNimSSS 

QQSSSSSSSSSI^SOSSSST^BISQQTTVVPESDSHSQVDWr 

YDRIBPRYCXCllQVSYGBMVGCDrrODCPIKHFHYGCVGLTBAPK 

gkwycpqctVaamkrrgsrhk 



ACPSAJYTHSOODHSMDBITAVVKX EKEVGC2JNGGWGNG3GAFS Q 
ARSSSTGSSSSTGG G G O gSQPSPLAIJAATCSRlgSPMKMSHNS 
QGPSQSGGTGKLDI.TATQLSQQANt3KQI ISSSSGATPT3KEQSG 
SSTlfGSKGS BSSKNRTVSGGQYVVAAAPHIjQNQQVLTGLPGVMP 
HI QYQVI PQFQ3VDGQQLQPAATGAQVQQDGSGQIQII PQANQQ 
I ITKRCSGGNI I^AJffmiQOAVPIXlOlJWNVI^TO 
VMJICINXTIjIiFVHSVSAATL.TPSSQAVTI 3SSGSQBSG3QPVTS 
GTTISSASLVSSQASSSSFFTHANSYbil TTTSNMGIMEEPTTSG 
SSGTKSQGQTPQRVSGLQGSnALNIOQSOTSGGSLQAGQQKBGB 
Q\HQOTQAAPJCSI^RPQbVQGG\QALQ\AFOAAPLSGQTFTTOA 
ISQSTLQUIQW^VPNSGPII IlHTTVaPNGCjySWQTI^ljQNLQ 
VQHPQAQTITLAPMQOVSIiGQTS8SWTTLTPIASAASl PAGTVT 
VNAAQLSSMPGDQTlNIiSAIiGTSOIQVHP IQGLPLA1ANAPGDH 
GAQLGLHOAGGDG 1HDDTAGGEEGBN 5 PDAQPQAGHKTRREACT 
CPYCKDSBEBtGSGDPGKKKQHlCHICXaOGICVYGKTSHIjRAHLRW 
HTGERPFMCTHSYCGBffiFTRSDKLQRHKRTHIUKKKFACPECPK 
RFHSSDHLSKHI RTHQHKKGGPGVALSVGTLPLDSGAGSGGSCT 
ATPSALITTNMVAMSAICPEGIARIANSGINVKEGGQFCSPnJT 
SANGF 
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61X5 1 

1 


! 324 

I 595 


71 


611*> 




1430 






119 1217 



Amino acid segment containing signal peptide 
(AaAlanine, C=Cysteine, D=Aapartic Acid, B= 
Glutamic Add, F= Phenylalanine , G=Glyclne, 
H=Histidine, I=»Isoleucine. Kp Lysine, 

L= Leucine, H=Methioaine, N=Asparagine, 

P= Proline, Q=Glutamine, R ^Arginine, 

S»Serlce, TaThreonine, v=valine, 
W^Tryptcphan, Y=Tyro3ine, X=Dnknown, *o£top 
Codon, /-possible nucleotide deletion, 
\opossible nucleotide insertion) 



RVDPRVRGAMAPWGXRLA0VRGVLLDISGVLYDSGAGGGTA1AG 

svbavarlkrsrijcvrfctnksqksraklvgqlqrlgfdiseqb 

V7APAPAAOQU£BRGIxEPyLLlHDGV\ASBFZX)XDTS/STPt3C 

VVIAEfilffiSFSTQHKtMAPQVI/IELBKFVLISLGKCTVYKBTCG 

IJfUJVGPYMKALEYAOGIKAEVGGKPSPBPrKSAIiQAiaVBAHQ 

AVMlGDDIVtH>VG(3iQR0G9ffiALQVRTGKFRPSI)2aHPBVKADG 

yvchiaeavdlllqbadk 



I ’ ) :'ci • IsH >h WcVj y tf ’FfUTtMn'j : >Xrj 



BTMVDHIA1ITB1H5QRIAAVESCFGASG0PLALPGRVLLGEGVL 

tkecrxkakpkipplfndilvygs ivlnkrkyrsqhiiplbkvt 

LSLLPSTLQAKHRNHXKIAXKSPWSAASATERQBWXSHIEECV 
RRQLRATGRPA\ STEHftAPWIPDKATDIOtRCTQTRFSALTRHH 
HCRKCRVVVCABCSRQHFIiLPRIi3PKPVRVCSIiCYRHLAA0QRK 
BKABBQGAGVPRAASHLARPICGRPVEKrmPTRTRRAAGTATB 
PAAHSSTPRGWPGIiPSTADPRPAEHLSPSQLHCPGPQECSSRSC 
PQIJlDPIPWKQfVQTWGVALSGLPVTPCWTIiC , PYGFTA(2^AFPP3 
KPQHTHESW 



PTSRPRPSPGSPAMSWSACVSAAPSSSWPASSSWPOGPRRCCTR 

RRRCSPftOGLAAGSHCSCSPSHRClPVPACHPSPPPXPABOVQC 

GHLPPHADRRALHbPVAAPARGPGPGHPAOTAOPRPARTPPASP 

HOPGRPTVPAPPCPLIAATBPTPSRPHQRKIllBIIK-TLGRGSQVT 

GRPQWFLRGLVLF3L 



DVCQRV CAHPHbYTHIHMHI CAHAC \ IHTHAQ 
SHLYTCWVMMASHTPSHTHPinAVHXEHRADVLRGTTiTPLR 



TGVMPPGRKUAA/ ISSSGPVFEGARA\LQTYKKKRRDB3YTFVQ 

aarpqtlkrpgoklprqlprqlryhbssgplbtlsrlrslcrwi# 

UtPDVLSRAQILELLVLBQFLS ILPGELRVWVQLHN PBSGBE\L 

hpchrscrgtlmghpggtralp\eprcaldgyrs\lrsaqiwsl 

asplrsssalgmclkppybikardflagqsotpaaqmpalfpre 

GCPQXJVTPTRSLTAQLQBlWTyKDVKVTFSCDEWGWLiJSAQWI 

LYKDVMLBNYKHMASbGK 



PGGGWTPGILKBGQGGRRTPIiHiLATRTS 
GLLGLPP PAAMHPAAPPLJ>VVVAAVLWQAAPTRGL IRATSnHHA 
SMDPTUJIiPALPGATLSQEGLQOPLVKAHPDHACSP IAP?PPAFV 
HGSVFXALIJdUTXTfFDLKVLHAQHAGYGAAVVHNVBSHELLIM 
VKNSHBI<KX)TWIPSVFT(SRSSBYLRALPVYKKG8lRVLLVPm 
TPPDGYTLlPrTGrVGUjVIAKGAVMXARCIQHHKIU/JJlNRLTK 
\EQI»KQl\PTHOTQKGDQYDVCAICLDBYBDGDKIiRVLPC7VHAY 
HSRCVDPHXjTQTRKTCPI ckqpvhrgpgdkdqbbbtogqbbctr 
GKPR nHPASBRTPLLGSSPTLPTS PGSLAPAPLVPPQPSTDPPL 
fiPPSSPVXLV 



STISQIACTSQAXPGAQSHRSARGHAAGGKBTAAIiGKBKGKVKK 
KBKBKBTQXKKIGSWJRKKKVKRXEVEQK3 KQBKQBKOKRRKGK 

kkbbebtkqgkbtkkekeqpkgqsekeekkdstltrtplbplek 

MKQIbVLGLDGJUSKTSVLHSLASHRVQBSVAPTQGPHAVCXSTB 

DSCMKFLBIGGSKPFRSYWEMYLSH/ADSLARSPSVGPEQDSQP 

nwKAlQCYIiHQLIAANPVLPLVVPAKKX2DLRAAYFrrrDIHRAIA 

II 



DPRFVTKHTTKAPAQKKTTUPRSSREGTLRSTMKYLSALNPSDL 

IiSVSNISSBPGRRVWTSAPPPQRPFRVCDHKRTIRKGLTAATR 

QELLAKA1iBTLLXJBGVLTLVI££DGXAVDSBDF7QLIjSDDTCLM 

VLQSGQSWSPTRSGVflLSYGUSRBRPKHSKDlAftPTFDVYKQNPR 

DLFGSLNVKATFYGLYSMSCDFQGI*\GPKKVUIBI*U?W7STLLQ 

GLGHHLLGISSTLRHAVEGAEQWQQKGRU4SY 
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Amino acid segment containing signal peptide 
(A^Alanine, C^Cysteine, D=Aopartic Acid, K= 
Glut am ic Acid, F= Phenyl alanine, G=Glycine, 
Ht4fistidine, I=Isoleucine, KcLysine, 
L=Leucine, N=Methionine, K^Asparagine, 

P? Proline, QsGlutamlne, R=aArginiue, 

S=Serine, T*»Threonine , V= Valine, 
S=Tryptophan, Y=»Tyrosine, X Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 



LBRAGGGGLSSRALVGSGACXiS LVARAlfGKGLPfiGRKEPVKAVR 
VRYVAFRYinrpRhVCLRIJ9SCSRBVTMSGRGKXJGGKVRAXAKSR 
SSRAGLQPPVGKVmtLUKGNrABRVGACAPVYlAAVLgYhTAB 
I LELAiCTlAARDNKKTRI IPRHLQLAIEHD^IKKLLGKVTIAQG 

g\vlpwiqavllpkktbsqkdb®vhdp 



FVRAQARGSRQPVRHPLIXanGSRLRCRSCGRMKPLKVBECFATAN 
EGNGLRAVT PIiRPGSZ»I*FRSl?PLAYTVCJCGSRGVVCDPCGLGICB 
mffiCSQCRVAKYCSAKCQKKAWmnraCKCLKSCKPRYPPDS 
VRUiGRVVFKLMDGAPSBSBHLYSPYDLBSNINKLTBDKKEGLR 
QbVMTFQHFMRHKIQDASQLPPAPDIiFKAPAKVXCNSFTICNAB 
MQBVGVGLYPS ISLLNBSCDFHCSXVFNGPHLLLRAVRJDIKVGB 
KLTICYIiDKLHTSBRRRXQUiDQYCFBCD\CFRCQTQDKDADML 
TGDKQVHKEVQB3I*KKIEKt*KAHWKHBQVLAMCQAI ISSWSERL 
PDIHIYQLKVljDCAMDACINId»LEBAIiFlGTRlMKPYRXFFPG 
SHPVRGVQVMKVGXliOLHQC^PQAMKNLjaAFDIMRVTHGRRH 
SlilRDtJIiLLB/AMRRQHQSXLSSRSQRBIRRVSLLHALLRSHT 
LCFVBCVNLSYWKPCSVPV 



I W ir ) >j 



RMKNPSBT5KPSMRSGDG 
BTCTPQfnKSLDFQKQPVPVQGAlSTAiQAOAFLGHLHDrVQhAGTSZt 
OAAAQflXWVQSKSHKBSGDSQQPSQPSQQPSVQAAIPOTQIiMLA 

ggqitgltltpaqqqujwoaqaqaqllaaavqqhsasgqhsaa 

GATISASAATPMTQIPI>SQPIQI AQDLQQLQQhQQCJNbKIjQOPV 
LVHPTTTOiQPA\0FIISQTPQGQQOLLQA\QHLLTQLPRQSQAN 
LiLQSQPRI\TLTSQPATPTCTIAATPXQTLPQSQSTPKRIDTPS 
LHRP \SDLKHLBQFARTFRQRRIKLGFT\QGnAGLAMVKLYGND 
PSPTTIFRFBALHLSFKNMCKliKPI»IjBKHI4IIiAENLiSSDS3Ij3S 
PSALNSPGIBGhSRRRKXRTSIEA\MIRVALBKSFLErAOKPTS 
KEITMIADQIiHMBKGVXRVWFCNRRQKEKRIHPPSSGGXTSSSP 
IKAI FPS PTSLVATTPSLVTSSAATTIiTV3PVhPZjTSAAVTNI>S 
V7GTSDTTSKrrtATVXSTAPPASSAVTSPSLSP3P3ASASTS&A 
SS^BTSTTQTISTPI^SSPIiGrpSQVMVTASGIiQTA/AQIJLPFMS 
AAQIJAHASIiAAMAAAAGLHPSIJ4APSQraAGGALLSIWPGT2jS 
GALS P AIJ^SNSTLATIOALASGGSLP ITS IJQATGNL VPANAGGA 
PNTVTAPIjPI2rp©TL8LLTSNP VSLVSAAAASAQH3APVAS LHA 
TfiTSABSIQHSLPTV7VSASQAASTTTTASKAQ 



HbLHRHFGTDMQHXJJFTTGBFQhTBACPYUmtSKBSRFGI USU 
HIXJPLEHKIWGrs/VFTPADYGICVTSI^IjrRiNffnjTVXDKXGVKGFG 
ARBIiLKVGC^LPGAGGSLRFXVPRSTLMDOUtQLRDSXQIXkS IT 

knfkvehigplpitvsslkingynoqgygfbvt^ochqfsldpbt 

SRDIS rVFTPDFTSSNVXRDLSLVTAADIaSPRFTUiJVTIjPHHIiL 
PLCAIP/VPGPGHBBSPfiRLTVFFVSljSIdjGVZLIAFQQAGTlUf 
BFMKTRQR&iASSSSQQiiNGPPfITVISFHSYKSNCKNFLDTTGPS 
DKGRGKNCLPVNTPQSRIQKAAKRSPATYGIISQKKHKCSVYYSK 
HKTSTAAASSTSTTTKBKQTSPIjGSSLPAAKKDICTDAMRHNHI 
S LRYASG INVMLQKNLTLPKHIdiJKEEHTLKWnVFSHPSSBCS 
HKBGIQTOiFPKifX’i)! Kl’SRfJTARFK RhRLCPLKTSKKLPEHHL 
PRHSPQYHQPDLPBISRKN&GKliQQVPVKHEVDHCEHLjOCVPTK 
PS5BKKXHKTSREDC4FSEKQDI PPVRQEDPTRKKKLOBKREGNL 
QNUWSKSRTCRKNXKRGVAPVSRPPEQSDLKIjiVCSDFERSKLS 
SDIHVR3HCIQKSTRBVCKADAKIASSLPAAQRKAEGYYQKPKK 
KCVDKFCSPSSSDCGSSSGSVRASR^WGSMSSTSSSDGDKKPM 
VDAQHFLPAGDSVSQNDFPSSAPISI2iLSH?rrCNPNTGNSLPQY 
ABPSCPSLPAGPTGVHKDKI^ySPGDl^PTPPVCVreSlJJCTLB 
MGVPCVIQRSAPVHHS FrDWSATCEGQPSSAYCPLBU3DYNAPP 
rhmmnyangfpcpadvoh>pidbnsqsthhtpp\nmpab \wsha 
QFPSSSRPYLKSTPKA(M«SGLFGPI\HAP\QSDVYBHCCPIH 
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Amino acid segment containing signal peptide - 
tA=Alanine, C=Cyeteine, D=Aspartic Acid, E= 
Glut ami c Acid# F= Phenylalanine, G*=Glycine, 
H^Histidlne , I=Isoleucine , K^Lysine, 
I*=I#eucine, Methionine, N^Asparagine, 
P=Pro3JLne, Q^Glutamine, R=*Arginine , 

S=Serine, T=Threonine, V«Valine, 

W=iTryptophan , Y^Tyrosine , X^Onknovn, *=Stop 
Codon, /^possible nucleotide deletion, 
\opoaaible nucleotide insertion) 



PTTEHSD /THl^QA\WCKBYYPGP\NPFRAXMtolWTrr\A“ 
NHNANPPLSRDSSTCGNV 

SDBAIiRlAGgRQCRVQLFEISLSHGRVVYSPGEPJLAGTVRVRXi 
QAPLPPRAXRVTCIGSOSVSMKAHPZAHWgBGYraS ^SLA PK 
GSI»PAGBHSPPFQFLLPATAPTSFEGPFGJCIVH(JVRAArHTPRP 
SnfflKCSLVPTILSPMLNSrPOIEQPNVASATKKFSYKLVRrG 
SVVM!ASTDIJlGTfVVGOALQl)HADVBRQSGiaJTSPVVASIiI<lKV 
SYKAKRW tHDVRTXAEVBGAGVKASIRllAt^lHEOIIiVPAIJQSAI* 
PGCSLIHIDYYLQVS LKA PUATVTLPVFIGNXAV/RPCPSBP PA 
RPGAASMGPTPGG\PSAPPQHEABAKAAMCFHPLDPVFT4>TKS 
HSQRQPLLATLSSVPGAPBPCPQDGSPASHPlflPPLCISTGATV 
PYEABGSGGPVPTTSTLIIJPPEYSSWGYPYEAPPSYBQSCGGVB 
PSLTPBS 

KTCPKLTCAFlVSVPDSCCRVCRGIlGEl*SHBHSDGDIPROPAN^r 
EARHSYHRSHYDPPPSRQAGGLSHPPGARSHRGAIiOSQQASGT 
IVQIVmHXHKIgjQfVCVSNGKTYSHSBSWHPWLRAPOIVE CVLC 
TCSVTKaBCKKIHCPNRYPCKYPQKXDGKCCKVCPG/KKAKBBI» 
pgqspdnkcypoghbtmpvybsvpmbdgbttrkxaleterppqv 
BVHVWTIRKGILQHPHIBKISKRHFBKLPHFKl*VTRTrLSQ»flCC 
PTEGBAQI S^CSSRVCSTBLSDLVKVLYLBRSBBGHC 



6124 



1S73 



236 



6125“ 



6126 



“1353" 



389 



RIiLSBAPCpKSRRRFQMNPKKGQAFVHVAyRGGl/ZAVAVP^OIP 
DSVSVQVUTKH^ARAPVAGLPAFIAMV PHSLVNMA7TLLGLSWL 
HRGGAMGLG PRYLKD VFAAMAIiLYGPyQWLPLWTQWRRAAVLDO 
WIiTU* IPAMPVAHdYIDRGWRP \HI*FLSLBCVSIASyGIALliH 
pqgfbvalgahwpavgoalrt \HHHYG/SATPSATYIAIGVLS 
CLGPVVLKLCDHQLARWRL PQCLTGHFW5 KVQ)VI0FHFAFI*FL» 
THFHTHPRFHPSGGKTR 



— 1335 



”463 f VLPRRCLVFVVNTMDSSRBPIXGRIiDAAGFifQVWQRKCADKKCfY 

IKBKELDAFFLHMUCOjGTDinVMKAmHKVKQQPKrrQnASKD 
GRXRtffiKLAt347LSBDB5FLLl*FHRENPLDSSVRFMQIWRKyilA 
DSSGPISAAR^SNFLRDSPIiHHKKAJSKAKLKBYTGTMMKI pur 
KKDGKU>U lDIJUtnAIiQENPUjOPKMDACSTBKRXGDFBKI FA 
YYDVSK!IGALBGP\EVD6FVKCMMBLVQPSZSGVDtiDKFRBXLL 
RHCDVNRDGKlQRSBIALCbGIiRIHP 



TQlMSRRQLKRWVWSSQQVQARGRNVPAPRLGKiAHGLBftSSKD 

SPGSLD<3lWKDAQKPQSAJTCX3GRKrRVYATSSRRAPPSEGTRR 

GGAARPKKTABBGPPAAPGSLRHSGPLGPHACPTASPKPQVTSA 

hssowgieplyikaepaspdspkgssbtbtbppvaiapg\pap 
TRCLPC 3HKEHEDGBGAGPGEQGGGKLVLSSLPKRLCLVCC33V7^S 
GYHYGVAS CEACKAPPRRT IQGS IBYS CPASNBCKITKRRRKAC 
0AOTPTKIXRVt3CJCBGVRl«DRVEGGRQKYKKRPKVLlPLPFPGP 
PPAGPIAVAGGPRJOAAPVHALVSHLLVVBPBKLYAMPDPAGPD 
GHLPAVATLCDLPDRRIWT I SWAKS IPGFSSLSLSDQMSVLQS 
VWMBVLVDGVAQESLTIiQDBIAPAHYLVLDEBGARPAGLGBLGV 
AAIljQLVRRMJAIiRLERKBYVLLKALALANSDSVm KDHPRLWS 
SCBKLTiHKACJJryBAGRRGPGQGAHRRRAGRLLLTLPLLR<?IAQ 



6128 



2511 



843 









KVIJUfPYGVKLEGKVPMHKLPIiBMLRAKMD 


6129 


17^4 


771 


ARFARSAHEGKMPKKKTCSIRKKAENRRBRKKQLRASRSTIDLAK 

HPCNASMBCDKCQRRQKNRAFCYPt3?SVQKLPXCAQCGKTKCMM 

XSSIKVTKRAGVYSTCIiAMVGAICX>FCKAHVQIGRKCLSTHACA 

cpltdaec\veckrgvwdhggrx pscs pchnflckddqfkhqas 

CQVl»SABTPKCVSCNlUiGQHSCLRCKACPCDDHTRSKVFBCQBKG 

JCQPPCPKCGRETQBTKDLSMSTRSLKFGRQTGC3KBGDGASGYDA 

YWKNLSSDXYGOTSYHDBBEDBYEAKDDRKKEDBGRKDSDrXKSS 

dlftnlklgrtyasgyahybbqkn 
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577 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine* D^Aupartic Acid* B= 
Glutamic Acid* P=Phenylal*nine, GoGlycine, 
H^Histidine* I*»Isoleucine, K=>Lysine, 

La Leucine, M=Hethionine, H ^Asparagine , 
P=ProlinE, 0-Glutamine, R=Argintne, 

S=Serine, T= Threonine, v= Valine, 
HsTryptophan, Y=Tyrosine* X^OnJcnown, *-Stop 
Codon* /^possible nucleotide deletion* 
Impossible nucleotide insertion) 
DFYRKBIBV\D3SPSVAGXSWTQQGTEQP\ASMRDLYIKKGQGC 
ILVY3I»VNQQSPQ\DIKPMHDQIIKVKVSKKVPVT\LWa?\SVD 
LESKRBVSSSBCmAIAKBKGCPFPffiTSAXSKTMVDKLFABIVEQ 
MNIAAQPDHDDPCCSACNIQ 

SSPRBKTSDSSHRPSHHCPLFLRLVGLSPPSYIiCVPPSRPVPGS 
PRSLSAMaLLPIAPGRIfiRGSPHHLPSCSPAIiIiiLVIjGGC3jGVF 
GVAAGTBRPNVVIiIiIiTDDQDBVIXX3frP2iKKTKAIaXGBMGMTFS 
3AYVPSALCCPSRAS ILJGBYPHNHHVVNm'LEGKCSS KSWQKI 
QB PNTFPAI LHSItCGYQTFP\AGKriJfBYGAPOAGGLKHVPIiGW 
SYWYAIiBKHSKYmm^IBGSAEKKGHHYSVDYLTDVLANVSL 

dfldyksnfbpffmmtatp\aphspwtaapqyqkafqnvpafrn 

KHFNIHGTIJJCHSiLIHOAKrPMTHSSIQFLDHAPRKBWQTIiSVD 

DLVKKLVKRLBPTGKLHHTYI PY TSDHGYHTGQFShPIDKRQLY 

HPDIKVPLLVRGPGXKPNQTSKMLVT^NXDlXJPTTUDlAGYtJXifK 

TQHDGHSIJJIlJGASHLOTaSPVl>VBTQ GB GRH V TDPTCPSItS 

PGV3(X^PPI>CVCBDATNHrtACVRTHSAIMIU}YCBFM)(JSVPV 

BVINLTAbPOQZTNZAICTIDPELLGRMNYRIKMLQSCSGPTCRT 

PGVFDPGYRJTJPRLHFSNRGSVRTRHPSKHLL 

AAGLLPPGLVPSDPfifiTBNLLPPGIQGPPPALSRPIiFSCVKSGW 

AMBAMBPBPLYDLLQLPKBVBPPAKBBLGKOGKXKYIiPPTSRKD 

V-KP BKlJPKPA\ Vli4BW INATUiPBIil WEST iKKDMFDGLILHHL 

PQRI*AALKLBABDIALTAT3QKHKLTWLSAVHRS\CESIRSGRP 

SGA/WESlFNKDLLSTLHLLVAIiAXRPQPDLSLPTNVOVKVm 

BSTKSGLKSKBI»VSQI»TEYSTOKDB PPXDVPDBLFKLAPBKVNA 

VKBATVNFVNOKLDRLGLSVQNLDTQPADGVI LLLLIGQLEG PP 

LHLKEFYLTPNS PABMLHNVTLALELL/ IGRGPAQLPC/LALK/ 

TIVNXDAKSTI^VLYGLFCKHTOKAHRDiOTHGAPN 

PVKGSMAim}I>FMBCBBBBLBPHQKISDVIBDSVVE0YNSVDKT 

TTVSVSQQPVSAPVPIAAHASVAGKLSTSTTVSSSGAtB^SDSTK 

KTLVTLIANNNAGNPLVQQGGOPLIXTONPAPGIjGTMVTQJPVLR 

PVOVMQHAMHVTSSPVASQPI PI TTO GF P V RNVRPVQHAMNQVG 

IVU?VQQGQTVRPITLVPAPGTQPVXPTVGVPQVPSQMTPVRPG 

STMPVRFTTNTFTTVT PATLT lKSTVP QS QS CXyrKSTPST ST TP 

TATQPTSLGQZAVQSPGQSSQTI^PXLAPSPPSPPAVSXASPVT 

VKRPGVTGKNSNHVAKLVHTLHTIPSLGQSPGPVVVSHKS9AH\ 

OSQjrrSGPKSSWKVTSSIPVFDLQDGGRKICPRCHAQPRVTBAL 

HGHMCYCCPBMVKYQKKGKSLDSKP3VPSAAKPPS PKKIAPVAS 

/THPSSTPIPALSPPY/TKVPBPNBHVGDAVQTKKEMLVDDFYT 

GHDGGKVAQLTTTPPKVATS FllCPHCTXfiliKNHIRFMNHMiCHHVE 

LDQQNGBVDOTICQHCYRQPSTPFOIiQCHLENVHSPYSS'PTKC 

VHFHMIHKDTWEiLCPYCI^VTKN(2DVPQQHYMRHQKR\HVYH\ 
CNKCffVQPLFAKDXIBHKLQHHKTFRKPKQLECHJCPGTKVTIRA 
SRGQPRTVPV6SNDTPPSALQBAAPLT3SHDPLPVFLTPPVQRS 
IQKRAVRKMSVMGRQTCLBCSPBIPDFPIJHKPTYVHCSIjCRYST 
CCSRAYAHHHXNBHVPRKSPKYIiALFKNSVSGIXIjACTSCTPVT 
SVGnAMAKHLVPNPSHRSSSILPBGLTHXAHSRHGQTRDRVHDR 
HVKNMTPP PSPPTHKAATVRSAGATPABPBELLTPLAPALPSPA 
STAXPPPTPTHPQAIALPPIATEGABC3JIVDDQDaGSPVTQBPB 
LASGGGGSGGVGKKBOI»SVKKIiBWIiPALCCNTBQAAraFRNPQ 
RRXRRWXiRRFOASQGKNLBGKYLSFBABBKLAEVA^LTQRBQQLP 
VNKBTLPOKATKICfRSIiBQGPKI S YBSAVRFTUiRHHLTPHARfiA 
VAHTLPKDVAENAGLPIDFVQRQrHNQpLPLSMIVAIDBISLFL 
OTBVbSSDDIUCKNAIiQTVGTCBPWCDVVLAILADG'rVLPTLVFY 
RGQHDQPANMPDSILLEAKBSGYSDDBIMKLWSTRVWQKHTAOQ 
HS kcmlvmdchrihlsebvlamlsasstlpavvpagcsski QPL 
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t ' « .• \A . .*.vV * * » ;'*i !***« »*‘l 

Amino acid segment containing signal peptide 
(A-Alanine, C=Cyoteine, D=Aspartic Acid, E= 
Glutamic Acid, F«» Phenylalanine , 0W3lycine, 
H-Histidine, I » Isoleu cine, K=*Lysine, 

L= Leucine, «=Methionine , N=Asparagine , 

P= Proline, Q=Glutamine , RsArginine, 

S^Serine , T=Threonine , V=> Valine , 

W=Tryptophan, Y=Tyrosine, X»Unknot*n, *=Stop 
Codon, /^possible nucleotide deletion, 

\ ^possible nucleotide insertion) 

IJVCIKRTVKNPIilKKHKEOARKNADTACnSDVliLiQLVLVWLGBV 

I/JVIGDCPELVORSFLVASVIjPGPDGNINSPTRNADMQKELIAS 

LKEQUttiSGKHSBSSTPRPRSSPKBTIBPKSLHQIiBnBGBSBTBS 

FYGPESADLD1MBI 

PVHGSMAim>LPMKCBBKBLBPtfQKIfiDVlBDSVVKDYHSVDKT 

TTVSVSQQPV3APVPIAAriASVAGHIiST3TrVSSSGAG»SD3TK 

K31»VTLiANNHAfflJPLVQQG5QPLIIjTXH?PAPGbGTMVTQPVLR 

PVQVT^NAmiVTSSPVASQPIFITTQGFPVPNVRPVQNAMHQVG 

IVLHVQQGQTVRP rTLVPAPGTQFVKPTVGVPOVFSQKTPVRPG 

STMPVRPTniTFTTVIPATLTIHSTVPQSQSQQTKSTPSTSTTP 

TATQPTSLGQIiAVQSPGQSNQTTHPKIAPSPPSPPAVSIASPVT 

VKRPGVTGKNSNKVAKLVNTLSTIPSIiGQSPGPVVVSNNS3AH\ 

GSQRTSGPKSSMKVTSS I PVFDLQDGGRKI CPRCNAQFRVTBAb 

RGHMCYCCPBKVBYQKKOKSLDflBPSVPaAAKPPSPSKTAPVAS 

/TOPSSTPIPAMPPY/TKVPKPREHVCTIAVQTKLIMLVDDFYY 

GRXXKSTCVAQLTlfPPKVATS PTlCPHCTKRIiKI^RF^ 

LD0QHCBVIX2ffTIC(fflCYRQPSTPPQIiQC3ILHIIVHSPYBSTTKC 

kicewafeskplpxqhmkuthkpgewpyvoqwoqyrsslysbvd 

\raPRMITffiiynUJIJjCPYCTJCVPHNl2ZAFQQHYMRHQKR\NVYH\ 
CKKCSVQFIiFAKDKIEHXLQHHKrFRKPKOl^GliKPGTKVTIRA 
SRGOPRTVFVSSHDTPPSAIXJKAAPI/PSSMDPLFVFIjYPPVQRS 
IQKRAVRKKSVMQRQrrCLBCSPBIFDFPNHFPTYVBCSLCRYST 
CCSRAYANHM INNHVPRKS PKYLAUPKNSVSG I KLACTS CTFVT 
SVGimMAKHLVPNPSH^SSIIiPIKaTWIAHSRHGQTRDRVRDR 
NVKHMYPPPSFPTNKAATVKSAffiVTPAB P8ELLTPLAPALPSPA 
STATPPPTPTHPQAIJU^PIATEGAECLNVnDQDKGSPVTQKPB 
IiASGGGGSGGVGKK8QI*SVKKIiRWLPALCQr/BQAAEHPRNPQ 
RRIRRWLRRFQASQGEMIiEGKYLSFBAHSKIAEHVliTQRBQQliP 
VNKBTLFQKATKIGRSlkEGGPKISYEHAVRFMLRHHLTPHARRA 
VAHTbPKDV7\BHAGIiPIDFVQRCIHNQDLPLSM IVAI DKI SbFL 
DTEVLSSDDI^EKALQTVGTGEPWCIJVVIAILADGTVLPTLVPY 
RGQMDQPANMPDS IIiIiKAKRSGTSDDBIMEEJiISTRVNQKHTACQ 
RSKGMliVMIXIIRTHJjSKBVLflHLSASSTLPAWPAGCSSKIOPl' 
DVCIKRTVKNFliiKiWKEQARa!AmACDSmn^L^VWI^GKV 
LGVIGPCPELVORSFLVASVliPGPDOilHSPTKNAIlMQBELlAS 
LgBQIJCLS GEHSBSSTPRPBSSPREriBPgSLHQI»PSGBSETB3 

fygfbeadldlmbi 

FVHGSMADTDLPI^CBEBBIiBPt^KISDVIBDSVVKDYNSVDKT 
TTVSVSQQPVSAPVPIAAHASVAGHLSTSTTV3SSGAQN5DSTK 
KTLVTIJCANHNAGFPLVQQGGGPLILTQHPAPGIXnMVTOFVLR 
PVQVMQHANHVTSS PVASQP1 FI1 TQ GPPV HHVRPVQHAHNQVG 
IVLNVQQGQTVRP ITLVPAPGTQ WKPTVGVPQVFSQHTPVRPG 
STMPVRPTTOTPTTVIPATI.TIfiSTVPQS0SamCSTP3TSTrP 
TATQPTSLGQLAVQSPGQSWQnKPKIiAPS FPSPPAV3 IAS PVT 
VKRPGVTGKNSNKYAKLVHTLNTIPSLGQSPGPVVVSHNSSAH\ 
GSQRTSGPBSSMKVTSSIPVFDLQXIGGMaCPRCNRQPRVTBAL 
RGHMCYCCPEMVEYQKKGKSUJSBPSVPSAAKP PSPBKXAPVAS 
/THPSSTPI PALSPP Y/TKVP KPWKM VGIl AVQTKLlMbVDDPYY 
GRIX5GKVAQLTNPPKVATSFRCPHCTKRLKNNIRFHNHMKHHVB 
UXXiJTCBVDCTTlCQHCYRQ PSTPFQIjQCHIiENVHS FYBSTTKC 
KICEWAPBSEPIiFLQHMKDTHKPGEMFYVOQVCQYRSSLYSBVD 
VHFRHIHKD'TRHLLCPYCIjKVFKHCaiAPQOHYMRHQKR\NVYH\ 
OnCCRVQFI*PAKDKIEHKM2HHItTFRKPKQIiEGIJCPGTKVTIRA 
SRGQPRTVPVSSHDTPPSAIiQBAAPLTSSICDPLFVFLYPPVURS 
IQKHAVRKMSVKGRQTCLECSFBIPDFPWHFPTYVHCSIjCRYST 
CCSRAYANHMINNHVPRKSPKYIiALPKNSVSGIKLACTSCTPVT 
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acid segment containing signal peptide 
(A^Alanine, Cysteine, D=Aspartic Acid, 
Glutamic Acid, F=> Phenylalanine, G=Olycine, 
H=Histidine, I =» Isoleucine, Kr>Lyaine, 
b=Leucine, M=Kethionine, tf^Asparagine, 
paProlioe, Q=Glutamine, R^Arginine, 

SsSerine, x=7hreonine, v=Valine, 

W=Tryptophan , Y=Tyrosine, X^Unkncwn, *=Stop 
Codon, /=posaible nucleotide deletion, 
\-pocaible nucleotide insertion) 



SVuUAMAKHLVFNPSHRSSS lltPRGLTHIAHSRHGQTRDRVUDR 
HVKNMYPPPSPPTHKAATVKSAGATPAEPBBLliTPIAPAIiPSPA 
STATPPPTPTHPQAIiALPPIATEGABC mV DDODBGSPVTQHPB 
LASGGGGSGGV&QCEQIjSVKKLKVVLFALCCMTEQAAEBPRHPQ 
RRrRRHLRRF^SQGENLEGKYLSFKAKKKlABHVLTQREQQXJ > 
VHKSTLPQKATKIGRSIjEGGPICXSYBHAVRPMLRHHLTPHARRA 
VAHTLPKDVAENAGLPIDFVQRQIHNQDLPLSHIVAl DEISLFL 
DTBVLSSDDRKBN7dXyrVGTGBPWCDWtAIIiADGTVLPTIjVFY 
RQQHDQPANMPDS ILI^KAKESGYSDCffilMELWSTKVVfQKHXACQ 
R3K3S^IiVMDCHRTHLSEBVLAHl>SASSTIjPAVVPAGCSSICrQPL 
DVCI KRTVKNPIWKKWKEQARKMADTACDSDVIJiQLiVLVWLGHV 
LGVIGDCPKLVQRSFLVASVLPGPDGNIHSPTRNADMQEEIjIAS 
L&EQLKlkSCBHSBSSTPRPRSS PHETIBPBSLHQLPBGB9 STBS 
FYGFBBADIiOLHBI 



KRKRKRSVUEOBHPDDGVROSPPEDYHIjGQVAS 
SbPRGSHHSReOTGRIAS tiFSSLKPQIQPVTVPVPK\BSALASA 
XJLEEQSlBQKQGQKRKZiSQPGVKVADRECILDDTBDTVySQRXKlO 
IKQBBKRLKHKRIVFVGNL PVTCN CXKLKS 7FKBYSQX8SVRFR 
SLIPAKGTLSKKLAAIKRKXHTOQKNIHAYVVyKBBSAATOALK 
RNGAQXADGFRIHVDIASSTSSRDXRSVFV(3tL»PYXVBRSAIBK 
HFLDOG$XMAV]UVIU>XMTGXGKGPGYV]jPBI?TDSVHIA 1«KUIH 
SBLMGRKLRVMRGVNKKXKKCXfifSHPRIiXNVSKPKQaiiNPTSKT 
ABGHPKSLP1CTKAVLLKTKKKGQKKSGRPKKQRKQK 



RALRKRROGPGEtKGALGSGPGPQRRPGRVPBERPAPPKKRKHPG 
MHNMLrVAMCIA\LJjGLPGKAQBIiQGHVS\lIIiAGSQIGDLAKX 
TLSfQG VLFQLTLDBAGRGHS PSPHGAALTAPKQGQSLMAKALES 
LSCPKBMAPSHCAEHKDQPLQ^SQYRQUCrREDYOALNKDlKAG 
LOHAGLREAGGIPYPSVPPPAYBDIARNIHSSaiPGPGAWLRW 
LBKPPGHDHFSAQQLATELGTFFQSBEMYRVDUY1/3RQAVAQII* 
PPRDQHRKALDCJLWNRHHVERVBI IMKBTVDABGRTSFYKBYGV 
IRIWIiCWHLTBVljTbVAMBLPHNVSSAKAVLRHHI^JVFOALRGI. 
OfiGSAWGQYQS YS BQVRRKLQKPDSFHSLTPTFAGVIjVHXDNL 
RHEGVPFII^GKAIiDKRVGYARILFKNQACCVQSKKHHAAAQS 
QCLPRQIiVFHIGHGDLGS PAVLVSRKLFRP3LPSSWKBHEGPPG 
IiRLFGS PI*SDYYAYS PVRKKDAHSVLLSHI PBGRKNV PITJTKNL 
LASWNFWrPLLBSLAHKAPRLYPGaAKHGRLLDPBPQSGRLPFS 
QCX5PBQLVPGPGPGP«PSDPQVLRAKyRB3SLV3AH3BKLlSKI* 
ANDIRATAVRAVRIi FGQFITLAbSQGSS FVALFQQI*ATAHYG PPW 
AHTHLWLVDBRCVPLaDP RSBHXMAHI^HVRI PYYNIH\AM 
PVHLQQRl«CABBDQGAHIYARKISAIiGANS5 PDIjVU/a'EADGH 
TASLPPQS PTCIiDGEQLWLTTS PSQ PHRRKSLS LP^ItfRAHKV 
AVLVKOTMKRBITTLVSRVGHBPECKHP ISGVLPHSGQLVWYNDY 
DAPLG 



BFSKLTDRWQHAVQGn/RQRKGDVDG&VRQtft^FTTSVKNIiPftFl* 
TDTSHL2>SAVKGQBRPSLY<rrRSLIHBLXNKBIHFQRRRTTCAl* 
T * .UAfTRBl ,t .7 .T TnT .KTKRSVCntB T SQXiQDSVKDMBPQXiABMIKQ 
FQ3TVBTWDQCBKKI KKLKS RliQVLKAOSKDPLPKLHKDLHNBK 
ELIKEI»BQ31iASWTQNLKKLQ^rMXADLTRHVI»VECfVHVU®QIB 
HUiRQffBDljCLRVAIRKQBIBOSlJIlWVVFSBXNKBLCAWLVQM 
BHXVliOXADISIKKMIKiCLQKDCMRBINLFSBHKIOIiKQHCDQL 

ikasnkshaabiddklhkxndhwqht^fdvigsrvkklkbtpapi 

OQLDKNMSm^TWLARIBSKLSKPVVYDVCDDQBIQKRLAEOQD 

I^RDIKQlIRAfiVRSVFNiaJVLLHDSDACANKTBCDSIQQTTRS 
LDHRWRNICAMSMKRRMia BETtmLWQXFLDDYSRFBDS LKSAB 
R7AACPNSSBVLYTSAKKKLKP P^AFQRQIHER T.TQLELXIJKQY 
RRlARBNKTDTASRLKWVHEGHQRWDHLQBRVTAVLRItLRHPT 
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NQRSBFBGTOBSII«VULTEI4IUiQLTOVEHFSBSIlADDKnRQljNG 
FQQBITUJTNKIDQI»IVFGBQLIQKSBP\LDAVIJBDSI£BLHR 
YOQBVPGRVSRPHRKLTSCTPGLgDKKKASEHgrDMBPPRBIQT 
DSKRKS6BSBBPSSPQSLCHLVAPGHBB5GCBTW3VDS\IPIiB 
WDHTGRRGGPSSSH\BEDBKAOYY\SAI*SGKSXSDGUiJWHVPDS 
PSCPBHHYKQMBGDfiHVPPVPPASSTPYKPPTOKLLLPPGTDGG 
KEGPRVLNGNPQQBDGGLAG ITEQOSGAFDRWBMI QAQK1*\HHK 
I*KIKQm^LHaDISATrrWLKKTBABLBmjKMAKPPSDIOBIB 
UlVKRKJBILKRFDTYKMiWS VNVSSKBFLQTBS PBSTBMJSR 
LRQLSUJJBAAQGAVDSIIRGGlJlQSI^4QOQPFI5JI^QliI*IiWIIA 
SAmUlOKAHVTDPKADPRALLBCRRKLMQLKKELVBRQPQVDH 

lqbisnsli*ikghobdcibabbkvhvt\bkklrqi*reqvsqdlh 

ALQGTQHPASPLP3PDSVDSGDQPPATSVPAPRAKQFSAVRTTB 








PSSKBDYSCTQWJIiPNARSFYPHLRYTOGPPPT 


6139 

| ! 


52 


1131 


bGDWVWSRTOSVI*BTPTSVLBHARARGPCPTDSKMAI»PRl»RBCS 
TBRRPWHASSWKTL/IJdMXGGAASVXVGHPIJXrVICraiXiAGVG 
YGNTI^CIRVVYRRBSMFQFFKGMSPPLASIAVYHSVVVGVFSN 
TQHFIiSQHHC3GBPKASPPRTljSBI*LLASMVftGVVSVGLGGPVDI» 
IECIRIiQMQTPPVSGRQPRPBVQGBGSCG\BPAYQGPVHCITTIV 
RHBGIJVGLYPGASAMLIiRDVPOYCLYPI PWPL3BWITPBACTG 
TS PCAVWLAGGMAGAI S WGTATPMDVVKSRLQAIXjVYI^KYIUjV 
UX:iS03YQKEGt,KVFPRGITVXfAVRGFPMSAAMFU?YKI>SU3A 

IRODHAVTSP 


6140 


694 


136 


PPmrT^r^T.Bag«ajRPIi3VP^QIR«W<KBPVRAQPLSVtVWAP 
RCQRP/QPPAPBPSSPHAAVPEA3 ptpraaasaalelplgpapv 

SYAPQAEAEWISTPGPAC3SRLGPBTFRQRPRQPRYQDAAGPRBA 

FRQL.RKL/SPRQWIiRPDl\RTKBQ\IVEMI>VQBQl3LAtlPEAAR 

ARRiRwmjVRrro 


6141 


2" 


9B4 


AQVOPRSHPCKWPLKLROiOWAKSKBTAGXiVEfeBPTGAGGGSbS - 

ASRAPARRLVFHAQLAHGSAIURVKGFSSIQELrAQXAGAPSXS 

PSKI I*3fCTMJTP KUJMKRUX3GQIX3LEDPI PAHVKDXBKBVNVY 
KSKDSI/5LTITDKGVGYAFIKRIKDGGJVIDSVKTICVGDHIBSI 
NGBHIVGWHHYDVAKKLKBIjKKBBLFTMKLIBPKKT^FKIKIjRSK 
WSKSSGSKIGOGKATLRLRSKOPATVEBMPSBTKAKXAIEKIDD 
VL3^Y>OXHDIDlATTMFEA3KDKVNPDBFAVALDBTlJGQ3PAFP 

DBFVFDVWGVIGCAKRROIf 


6142 

V 


116 


602 


BA^BQVCGAKCOGOAPHVEHRKEKTARIGPGVMKSKiiliRAIJiN 

T.TUtrrrimf)KNnpyT>RKi^nrVANKGrSPIjAItPllJVSBrgCVeKdNkRR 

PHVHQPILQYRSDlMHHLGBPQARMREBNMKRIGKKVRQI^fflKL 

HBKQLSHSLRAVSTDPPHHDHHDBPCXlMP 


6143 


2902 

J - 


575 


FRKRI PUKIPHH^X^WKIHHLLBTSLBSCKAHliS 1QKLLKER\Q 

\QLPVFKHKD3IVBTLKRHRWVVW3BT\GSGKSTQVPHFLIaKD 

LLLNBWKASKQTIVCTQPaRI ^VSIJUmTCDBIXSCENGPGGRN 
SIXX»TOJRHB3RACBSTRI»LYCTTGVIiLIUCL0BDGLbSHVS /HM 
PIVDEV\HKR\SVQSDFLLI ILKB1LQKRSDLHLILMSATVDSB 
KPST YPTHCPXLRISGRSYPVBVPHLBPI IBBTGPVLEKPSBYC 
QKFLEBEHBVTXNVTSKAGGIKXYQBYIPVQTGAHADLNPFYQK 
Yg$RTQW^TTtYWKPHKTwr.nTJT.RTJAYTJ)K3P0PHWIBGAVLl 
FLPGIAHIQQLYDLIjSNDHRFYSBRYKVTALHS ilstqdqaaaf 
TLPPPOVRKIVIATNIABTG ITIPDVVPVIDTGRTKKNKYHKS S 
QM3SLVKTPVSKASALQR0GKAGRVRDGFCFRMYTRERPBGFMD 
YSVPBILRVPLBKLCLHIMKaiLGSPBDPLSKALDPPQIjQVISN 

ahnllrkigacblrbpkltplgohiaalpvhvkigkmlipgai f 

GCU)PVATLAAVMTBKBPFTTPXGRKDKM)LAKSAIAMADSDHL 
TIXNAYIj^KKAROBGGYIWB rtY CRRNFLNRTSLLTIiBDVKQB 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D^Aspartic Acid, E=» 
Glutamic Acid, P=Pheny lalanine , G=Glycine, 
B=Histidine, I=Isoleucine, K^Lysine, 

Leucine, M=Methionine, M=Agparaglne , 
P=Proliae, Q=Glutamine, R=Arginine, 

S=Seiine, T^Threonine, V°> Valine, 

H^Tryptophan , Tyro sine, X-OnXnown , *«Stop 

Codon, /^possible nucleotide deletion, 
\~possible nucleotide insertion) 


1 






LXKLVKAAGPSSSTTSTSWB«2?HASQrrLSPQBlALLKAVLVAGL 

ydnvgki iytksvdvtbkiacivkxaqgkaqvkpssvhrdwith 
GHLLYQBK-X RYAHVYLRETTL Il'PF PVI»LPGH>IBVQHRBRIiI»S 
nX3WIYFQAPVKIAVIITCQIiEVLlI)6VIiRKKLBlIPBHSI£HDK3 
LQIITHLIETBEJ? 


1 




568 


SGPGSMSGQRVDVKWMIHKBYVOKrSLVBRYViil>ftFI.VGPYQN 
VSASGGARifGGRCSGGPVXCTTGPDLFPLVA\riGAAFVAlCVMS 
VGDRTVTLGIWDTAGSERYERMSRIYYRGAKAAIVCYDLTDSSS 
FCTftKPjryiqCTff^jrarynrrtlifrKSPIJigiffiRRBRHyDFHIW 
QDYADNIKAOLPKTSS KTGOSVDKLPQKVT^KDYVSVAAFQVMTB 

dkgvdlgqkpnpyfyscchh 




1109 


196 


GGMDLSRLBRDNTGRCBI*S^t F VPAVCKKBPCVLGVDKAGRGPVI» 

OPHVYAICYCPLPRLADLBALKVADSKTLLKSKRSRLFAKMBDT 

DFVOHMiDVLSPNLISTSHI/iHVICTNIJiSLSHDTATGLIQYAIJ) 

OGVNVTQVFVtmA^BTTOARIiQOSPPGIBVTVKAKAnALYPV 

NVSAASICAKVTffiDOAVKKWJFVKKLQDLDTOYGXSGTPHDPQD 

/tkawukhhvkpvp\gpp\qfvrp\skbtoqti \lekbabdvir 

KDSASBNQEGLRKrTSYFUJEGSQRRPRSSHRYFLERfiliKSTTS 


■ 


mm 


781 


LKRKGKBKABAQQVEAIiPGPSIiDQWHRSAOlSBKDCJPVLTbBQKS 
R/YPGHBAHDQG0 \W)ARQSIIRKVVDPBTSRTRIj 1 KGD6KVLB 
BXVTKKRHIUIINKQATRGDCliAFQMRAGLLP 


6147 


i 


2304 


GTRQL P P PS PGSGPGDS PEGPKGKAPERRRKAHGMLKLYYGLSB 

GRAAGRPAGPDPLDPTDLNGAHFDPBVYLDKLRRECPLftQLMDS 

BTDMVRQIRALDSDI'WTLVYKHYNKFI SATDTXRKMKNDPRKMB 
DBI®RLATmiAVITDPSARISATLQDRHERITKLAGVHALLRKL 
QPLFBLPSiaiTKCVBUSAYGOAVBYQGRAOAVLOQYDHLPSFRA 
IQDDCQVITRRliRQQLRQRFREGGSGAPBaAKCVEIjtiljALGBPA 
EKLCKKPLAHARGRIjSKKLRNIjBAB 1/5PSPPAPDVLKFTDHG\S 
SGPVGGIXYJVAAAYQBLPAAQGPAGAKKLRAPARQLGSRYFALV 
BRRIAQKQGGGDHSUjVRALOHPHHRLRAPGALLAAAGIADAAT 
KIVEttVARBfiliGHHUXJLRAAPLGCLTDVRQAIAAPRVAGKEOP 
GIAEIJjANVASSILSHIKASLAAVHLPTAKBVSPSNKPYFROBP 
CSQ0VRB0I»IVGPVHSM0QfIAQ8FCDSPGBKDGATPPALI»LIjL3 
RLCUJYBTATISYILYLTDBQPLVODQPPVTPVSTIiCARARBTA 

KUJ*THYVKVQGLVISQWI*RKSVETRDWLSXI»BP , RHrVHAVMKRV 

VBIXITAlDVCfVLPRiAGVAIiTORGGTVPSRGAGAAKDHMQSLPG 

GGDMCTIBiSHQASSVARASVRBPQGlJKSPRMRTKRAGBCLCTRS 

CSFSAQPYDIFAPILPVBKQRLBVTQEVIlASIiVLVLICIRPQTJJS 

C1LPLPHSTGSIHSDHVPTK 


614 B 


355? 


555 


VPftycrryi^TY^i^^RKinrYTYSCDIiDIMVQliBIGSliBGKRBQ 
KSYKAVLEDPMIJCFS®CiYQBTCSDLYVTCQVFABQKPLALPVaT 
SYKAPSTHWHHNBWLia*PVXYPDIiPRNAQVALTIWI) VYGPG KAV 
PVGGTTVSLPGKYGJIFRQQIHDIjKVWPMaaEOMDQKP’nCTPCRT 
SSTLSKIX2MSRXAKLTKAHRQGHMVHVDWLDRLTFREIBMXI3KS 
VKRSS»PMYLKGCPRCVKCDDKB?rGIVYYBXDGDBSSPILTSPB 
LVKVPD PQ^SLEHLVKSKHHHLPaSLBSGPSDHDIiKPYPSPRDQ 
i^kntvsyppskpptybkqdlvkbetryyltoodkaltki ltsvi w 

DI^QGAKQALALU3KHKPMDVEDSLBLLSSHrXNPTVRRYAVAR 
LRQADDKDLU'lYXJUQLVQAIiKYENFDDIKHlHiBPTKKDSQSSVS 
BHVSNSGINSABII>SSQirT/SAPFPSVSSPPP\ASKTKBVPDG 
EHLBQDLCTFLI SBAS KNSTIJUinn,YWYVrVECSDQiyPQQRDP K 

XHBMYLNVHRRPSQAIXKEDKSVRVMRSIlAAQQrrFVDRLVHLM 

KAVQRESGNRKKKHKRLQALLGDWKKHNLSOVELIPLPIiBPQVX 

IRGXIPBTATLPKSALMPAQLFFKTHXXIKyPVIPKHGDDLRQD 

QLI LQ1 ISLMDltTiIiRKHJIiDLKLTPYKVIiATSTKHGFKQP IQSV 



456 











Predicted 
beginning 
nucleotide 
location 
Corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 





PVRBVLinrEGSIQNPPRKYAPSKNGPNGLSAKVKDTYVKSCAG 
CViiJf lI/JVGDEHLDNIdiLTKTCKLFHIDFGY ILGRDPKPLPPP 
MKLSKBMVBQCGTQSEQYQEPRKOCYTAFIiHIfiRYSNLILNLP' 
8LMVDAH1 PDIALEPDKTVKKVQDKFRLDLSDBHAVHYKQSMD 
KSVHALFAAWEQIHKPAQYVrRK 



RVDPRVHKNOTAHP I KNGKTS PAS KDQRTQKKTSVQGQVQKCUD 
gSBSDFBSPPPSPK3SgBBBQDDEBVEQGBQCTPHDDPr gP BNL 
6HRPLIMDSBDBBSBBKBSSDSDYBQAKAKTSDHS5VYRDRS68 
GPTMJfTlXjLTSAQLSSDVAVBTPKQBroVFGAVPFPAVRAQQ 
PQQKKJfflHHLPQHRPPAAQLEQBKPDVFTKAPPSKKVNVQBCHA 
VGPHAHTIPOY PKSVDVPGSTPPQPPLTSTSKSBEMHDLPGLVP 
FDErR&QQQKVKQRSLQKLSSRQRKTKQDMSKSHGKRHHGTPT 
STKKPLKPTYRTPERARRHKKVaPPDSQSSlIBPLTlSDSKENIS 
VALTDGKERGNVlJQPRBSI*IJ)PFGftKPFHSFDVLSKHPP\HQGIi 
S\DIRADHHT\VIiPGH\ PRQNSLHGS PHSAIJVLKMDD FGAVP / P 
LTKLWOSITPHOSQQSQPV\BLDPFGAAPPPSKQ 



XNNFHSDSBYR 
LNKBSSVIHAVXJCMLAQHALLIAI SSDUiAYGVVCBFDWNDCSSG 
C^GWPPMDGSEGna’TDIIXrSGIP 



DSHQQ8VSGTAASTUiK3FKATIYYQCnX3iVQQPYGVT5FY6QT 
TPPIVQ3YftQP3IX}YIQGQQI FTAHPQGVWQPARAVTl'I VAPG 
QPQPIjQPSBMVVTKNLLDliPPPSPPia’fCnVLPPNWKTARDPEG 
KlYYYH VI T R QTQWDPPTWESPQDDASLEHKAHIlDIiGTPTYPgN 
PMK\ASKKPKCAEADI3SEIJ^KKSKBVFR1CEMSQFIVQC1J5IPYR 
KPDCKVg\R I TTT EDPKHl ARKLTBGVMHKBLKYCKNPB\DIiBC 
NKNVJCHKTKBY I KKYHQKFGAVYKPKBDTBFRVTVGPGWKDGHS 
GKTDSRKRKSOGPPCSTPVSTVIjIiMnHPGBPNPADVN 



NRTHSTPSTWMGVAIipPLCSTGPWPVTRQITARTTCGAVPAKCP 

PMC/DVHKPRCQPPDCKGHGTC'/DGHOQCTGHFWRGPGCDELDC 

gpsncsqhglctktgcrcdaghtgskcsbkcplgwhgpgcqrpc 
kckhhcpcepktctcsvsrvkqcloppkatlragslspftrtam 
IJUjTLAIAFLIJUISTAAKI*S i*ll*srakrnrrlbcidyayhpuqem 

NGBPLAAKKBQPGGAHNPFKD 



GRVGARGFGRAYALLliLblCFNVGSGliHIiQVLSTRNKfKIiLPKH 
PHLVRQKRAHITAPVALLEGBDLSKKNPIAKIHSDIiftEHRaaua 
TYKXTGKGlTBPPFGIFVFBKDTGBIiNVTSIUlRBSTPPPLLTG 
YAIiDARGRHVEKPbBUtlKVLDINZaVEPVFTQDVFVGSVBSLSA 
AHTLVMKIKATDADBPNTLNSKiSYRIVSLSPAYPPVFYIjNKDT 
GKIYTTSVTIDPHBHSSYTbTVBARIXSMGBVTDKPViCQAQVQIR 
ILDVNDNIPVVEMKVLBGMVBBNQVNVHVTRIKVPDADBIGSDN 
NLANFTPASQtBGGYPHIBTDAQTHBGIVTIfIKBVDYBEMEiniD 
PSV1VAHKAAPHKSIRSKYKPTPIPIKVKVKNVKEGIHFKSSVI 
SIWSBSMDRSSKGQIIGNFQAPDEDTOIiPAHARYVlCCBIffiinffl 
igmqVTSBrKLAKLPDFKSRYVQNGTYTVKlVAISKDYPRKTI 
TGlVIiIHVKDINI3NCPTLIKPVQTICHnAKYVNVTAKDIjDGRPH 
SGPFS FSVXDKPPGMAEKNR1ARQRSTSVLLQ0SBKKLGRSB IQ 
PI»ISIQ®QGPSCPKKQVLTLTVCBVLHGS \GCRBAQHDSYVGLGP 
AAIAU4ILAFLLIiI>I*VpLLLliMCHCGKGAKB^FPIPGTXBHLHP 
HHHBGAPPEDKWPSPliPVDQGGSLVGRNGVGQOdCEATWKGSS 
SJ^rVKGQHB?SBMDGRWBEHRSLLSGRATQPTGATGAI\MTTK 
TTI1ARATGAS RDVAGAQRAAVALNBEFIiKNY FTDKAASYTEED 
KNHXAKDCLLVYSQKHTESLNflSIGCCSPIEGBLDDRPIiDDLGL 
KFKTIiABVCLGQKIDINKKIEQRQKPATBTSHNTASHSLCBQTM 
VHSEWTYSSGSSPPVPKSU2KANABKVTQBIVTBRSVSSRQAOK 
VATPLPDPKASRHVIATBTSYVTGSTMPPTTVILGP3QPQSLIV 
TKR\nOU>ASTLVDQPYANEGTTVVVTBRVIQPHGGGSNPLKGTQH 
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Amino acid segment, containing signal peptide 
CA= Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, P°Phenylal«nine. G^CJlycine , 
H=His tidine , I=>IaoIeucine, K~ Lysine , 

Leucine, M=Methionine, N=Aspar3gine, 
P=Proline, Q=Glutamine, R=Arginine, 

S= Serine, T=Threonine, V=Valine r 

Tryptophan, Y=»Tyrosine , 3D=GnJcnoMn, *=Stop 
Codon, /^possible nucleotide deletion, 
\opos3ible nucleotide insertion) 


■ 






LQDVPYVMVHKHSSPLAPSSGVQPTLAMPNlAVGQNVTVTEkVL 
APASTLQSSYOIPTBNSMTARNTTVSGfiGVPGPLPDPGLBSSGH 
SRST I TTSSTR VTKBSTVQHSYS 


1 


3660 


■ 


'KKmSj^TUilCWNFaAHPH^ SDK5HLLQHVSKLDLTDAKN” 
SDTAHXKSIBITSXLNGLQA9BS8ABpSBQBDEKGAQDMDHN3K 
BB8KIDHLTNHKNPLISyB80W335LI»BESKVHRDLVXS KPVSK 
3PgRLRKDlEVhSBPTOYEgDBVTHKRKPVKKPTTOKSSKPQlK 
RGKRK T CN’IRBCLKTGS PGJCKKBKAKNKKSLCHKNSSNS5SDSD 
KBBTKAKWTPTKXYNGLKKXSXSIiRTTGFYSSPSKVABKRIKLL 
NNSDKRUJNSRAKDRXDVWSSIQGQWPKKTLKKLFSDSDTEAAA 
SPPHPAPEBGVAHESLQTVAKERSCSPSVELBKPPPVNVBSKPI 
BBICTVeVNDRKABFPSSGSNFSA* IPLPYLHLNRLKQSL*QKGS 
RQQSSVTVSBPIiAPNQBBVRSIKSBTOSTIBVDSVAGHLQDLQS 
ERB*LASRP* CQCKLEQ* ♦SARTRT3* KSLYRSBKSKRCSGRRK 
PIKXABKKP*SH3GKQQHEGK 


6155 


8 69 


121 


HLI^KJLRGKSWITMinfVFyiiGVLAGTFPFADSSVUKBDPAPyLV 
YLKSHWPCVOWUBCPSWVLAPAHCniPin»KVMl£NFKSKVRDG 
TEQTINPIQIVEY»fflYSHSAPQDDIilLIKLAXPAMU?PKVQALN 
p\PKllVHPGTVCLLSGIiDW5QBBSGRHPDLRQHLBAPVMSDRB 
CXJK7BQGKSHHHSLCVKFVKVPSR3 FGBVAVATVIOCDKLQGIE 
VGHFMOQDVGIYTOVYKYVSTCIEtriAKDX 


6156 


5725 


3984 


GTSTVTMATKKHPSIIUiIiI/jMlXKlCI3NQirniKIj£iffnSAI>KVAV 
VMPCKSKTYAPLFCXPSPHKFCIOGIiLADTLVKDyNICIiQACSSIjH 
ALSSSLPODLLQRCVDVCRVQLVHRGTCIRQAPGKLIiKS I PLGV 
PLSNNNHTKIQEISIJaJRSHMSKAPSlITPHPQDPSD/VISPILY : 
GNBHRTGKIKWLKRI^SCX)1dJ)KRDQSTIPHI!ni»KimV^ i 

AIWBAAQFTVXSKLRTPI/JRAQDTFQTIBcnilXSLAGHTIiNPDQ : 
DVS QWTITU^EGHGNNQI^RLVXiLLQ YIiENLEKLM YNAYEG CAN j 
ALTS PPKVTRTFLYTNRQTC'C^JWLTRIRLS IMRVGLLAGQPAVT 
VBHGPDLLTEKKrTSLSQGHSLgVSXMMWEALCSLHCPEAlQG 

iavwsss ivgkhllwimsvaqoaegrfekasvkyqehlcamtgv 

DCCISSFDKSVLTLRSAQCXSASLKHCLNGBSRKSVLSKPTDSS 

PBVXNYLtantACECyiSTADNAAVQEliaNAIHDLXKSTSSTSIJH 

LKADPNYXKSLSSPBSGKFVECTBQIiBLLPGBNXNUjAGGSKBK 

IDMOOiLMM 


6157 


946 

1 


329 


MANRG PSYCHS RKVQKKI EQKYDADLBNKLVDH 1 1 LQCAED I BH 
pppgrahfqkwlmdgtvlcioinsltppgqbpipkiseskmafk 

OKBOISQFIjKAAETYGVRXTOIPQTTOIjHBSKDMAAVQRTLMAL 
CSVAVTIOHJGCYRGBPSWPHaKAaQNHIlGPSBEQLRQGQNVIGL 
QM5SNKGAS0AGNTGYGMPRQIM* daascp 


1 


ITx ~ 


1482 


LGSLIVLSifiCKVT yi^SLKHAMXRKAVDLVP ILAQNPQLAQN 
PILBGKDHNQjrTGVDPIXDHVQERKTD/SHSKSPHKKRSKSRBR 
RKSRSR3HSRDKRKDTRBKJ RBRBBVKBKDREKERBREKBEBKB 


KSRGKl»JUJKUKnKI%nlilKBill/ftJtl\X>xinwnnni\nnn»\fc*ni^i\f\Rrii\r, 

qdkbkerbkdrskbimkrkkdkksrtpprsyhashhsrsssrb 

RRHRRSRSSSRSPKTSBCTIKRXSSRS PS PRSRNKEDKKRKKERD 

hiserhkrkrstskrkssndkdgkekleknstsjlkbkehnkbpd 

SSV3KBVDDB0lAPRTKKNKIQHICaJCQI2JEENLSTKTEAV 


H 


53 


84 


AVIA^ldtiXDEARPYIJttrrHKSSTXCSHRRNQSFPPVMSLTH 

JUjn/nCYWGCAVSIiVCRFHKa^LPUirvVPyTLPVSIjPVGS cv 

X ITXnTXLTFVKDPQLBVNFYTGMDBDSDIAFQFRLHFGHPAlM 
HSCVFGIHHYBEKCYYLPFEDGKPFKLCIYVRHXBYKVMVNGQR 
rCIFAHRFPPASVKWU}VFRDlBLTRVLXSD*GRCVRITAVOEP 
DVSVSCDCTTAVQPG 


6160 


1626 


1790 


AGAKFPP*P*KVADAQPTBSEKBXYNQVNWLKDAEGILEDIiQS 

ysgaghe ireaiqhpadbklqbkas^vvplvgklkkpybpsqr 



458 
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LSAAliTOJoIjGLMjTSTPYSPTQUIjEREQAlAKQrABIIiHcTIiRPD 

ELKMTNPAXQilDFSYYRHTLSRMHIKRVPABGBNKVWirBLANHIl 



EVMLBTP BYTiSRFTNBBTVS PCLRVMVGVI ILYDkVliPVGAFAK. 
TSKI [JffiCCIgVLKDQPPHSVEQXJjMMjHrrTKHLMDBYTSKOI 
KSMLQ*QLLTLVNKG 



PVSGSKSSLRRAWAS 1 LRLMIjG PRVAV3 1 LCKDGXSH* IlLKKH * 
KSHVI»BPLSSIAIjBEQCIAl*SIJmSTGKTCRA^PLSCI ISSDS 
TGQL3LLWVHBTRPRljQKVASKQAHQFBWlUUlFNYItflPKIVYS 
GGDIXniRG?fDTKVPGKFliFTSKHHTMGVCS I QSSPHHEHXLA.T 
GSYDSHIIJJ«m?NMKQPUUrrPVQGGVWaiKHHPFHHHIJJiAA 
CNHSGPKUJiOQKAMKBRQHATVI»T3HTI»PDSLVYG3U>WSWr*I»P 
RSlXJRAPSWSFTSHLGTKTADLKGASKLPTPCHECREDilDQEQH 

mpqsgmkpltbgmrkbgtoloatarperdosvhpbbadsafsi* 

iatcspyiihalhlhhwegn 



RTXHATGRAGASPMHHIilVTKRLABAHKC^IVRCUSCajBPGHWTYB 

CIGKRKYLHRPSRTRKIiKKAIiKEKSHKlJjMJOSICTTOVKHiO^K 

KKHSKSVTSS3SSSSDSSASDSSSKSKBTSTSSSSKDSDTDBSS 

SSSSSSASSTrSSSSSnSDSnfiSSSSKQ # BQHR*QL*R*TTKBB 

BKBXKLLHSYWlPGLKTm 



HX^lTBGCAVRr-HPTQI«X3 KAR1MI LL3VS IiORHMAFTYKFFI> 
TPVVF7FPFFPFHRKB*vm}KNPMKSRBDEKMBKHJNiaVQRAD 
MNRI^MHYLVTBOFKBAABKPRMBSG ISPSVDLSTLDBRIKI RB 
fttLR30IQB7lIAbINSLHPBIiIjiyiNRYLYFHLQOQHI»IBI*XRQR 
BTRAAIiKEnQTQIiRKQGKBSRBCLTEMBRTLRIiLAPDSPKBSPP 

ggJ^ ntQRQKVWSBVKQRVLIWf BKRBSTPKLAKLI JCT JiLBRQH 
BLDQKKVKYPKKTDLSKGVIBEPK 



PCQSTORSftMRQDKI*TGSLRRGGRClAKHQGGGVGT ILSNVLKKR 
6CI8RTAPRIJXrn^KPGVDTKIjKFTLKP3Ii3QjroPQQWYDAIiKA 
VARLSTOIPKBWRRKVWI»'niADHYIiHSIAIDHX>KTKRPTFI9BRS 
NPDDDSMOIQIVKDLHRTGCSSYCGQKABODKVVlJCRViiLAYAR 
WNKTV0YOQGPNXLAALlLBV^IKC3^EGDALKIMXYLIDKVIiPBS 
ypVHULRALSVDMAV PRDIJjEMKLPSLSQSILDTLQKTANKESGG 
GYKPPLTNVPTMQW PLTI*FATCLPNQTVl*KI>mSVPPBGSB I XL 
BVSIJ1I WAKLGHQI ECCBTADBFYSlMGRLTQEMIiENDIJUQSHB 
LHQTVYSM71PPPPPQLAELRSKYTYNI TPPPA3VKPTSVSCTHS 

kardsdbkhdpddeiiavvnavgclgppsgplaphlqkyqkq I KE 

PITEEQSLRSNNIABIaSPGATMSCRSBYHAAFNSWMRRKI'XOIK 

ALKRQYSKIKKKQQQOVEOVYIRADKGPVTSILPSQVNSSPVIK 

BL£J£BKMRMINRAARNAVIBI PGHTGGKISPVPYBDLICTKIiKS 
PTraTHlHVHKKNMPHTKSHPGOGDTVaiiII)K©ffiASKTWGI>GAA 
EAPPSGCTATRGRBGSSPEGSTRHTXBGQSPKFVPOnADVDVSA 
VQAXUGAtBIiNQRDAAABTELRVHPPCQBBCPBPPSAPBENKAT 
SKAPQQSMSKTPIPSPPPSVKPLRKSATMHLQLYOPTBRTPTV 
HPPQMSR3 P3 KPGGCHJSOP * KMVFSSGTMLSRQLPGYPQBYQRli 
GQBRFG 



PC03 PGRSRMRQDKIiTGS LRRGGRCLKRQGGGVGT1 I*SNVUCKR 
SCISRTAPRLLCTI^PGVDrTKLKPTLEPSLGQNGPQQWYElAIiKA 
VARLSTGXPKSHRIIKVKLTLADHYLHSIAIDWDKTMRFTFNBRS 
UPDDDSMGIQ r\na)LHRTGCSS YCJGQBAEQDRVVIiKRVLIiAXAR 
HHKTVGYCQGFKIIiAALZ LEVMEGNBGDAZiSIHlYLlDBCVId’BS 
YPVHNLRALSVD^VPRDLI^MKliPBLSQHljDTUiKTAflnCiiSGG 
GYBPPLTHVFTHQWFLTLFAXCLPHQfJVIJCIWDSVFFBGSBXIL 
RVSLAIHAKLGEQIBCCKTAIJBPYSTMGRLTQEHLENDIJX2SHB 
U4QTVYa«APPPPPQlAEIJlBKYTYNrtPFPMVKPTSVSGRBS 

kardsdebhdpddhqawnavgclgppsgfiapblqkyqkqikb 
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Amino acid segment cont ainin g signal peptid 
UWUanine, C= Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, Phenylalanine, G=Glycine, 
11 = Histidine, I =» Isoleucine, K= Lysine, 
Leucine, IMtethionlne, W=Asparagine, 

P= Pro line, Q=Glut amine, Arginine, 

ScSerlne, T=Threonine, V=>Valine, 
H=.Tryptaphan, YoTyrosine, X^Cnknovn, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possl_ble nucleotide insertion) 



PNEBQSLKSliHIAKLSPGAIRSCRSBYHAAPNSKMMKRMTTDIN 
ALKRQYSRI KKKQQQQVHQVYIRADKGPVTSI LPSQVNSS PVXN 
HIXLGKro!KMI33iyjVKHAVIHXPGHTGGKlSPVPYHDLXTKINS 
PWnnRVHXKHMPRTKSHPGCGDTVGLIDEQNKASKTNGLGAA 

VQAXLGAL^USQRDAAABTHLRVHPPCQHHCPBPPSAPBBHKAT 
SKAPQGSNSKTPIPSPFPSVKPLRKSATAENIjGLYGPTKRTPTV 
HPPQMSRSPSKPGGGNSGP * KHVFSSGTMLSRQLPGYPQKYQRN 
GGKRFG 



HKLHRTVAMAOAEWKSLBKCLEKHLPLPDLQEViCRVLYGKBIJlK 

ldi^rkafeaasredfklqgyapbaakeqlrrprxvhvolvqnr 

IPI^AHAPVAEQVSALHHHIKAIVBVAAMOGVHI ICFQBAWTMP 
FAFCTRBKLPWTBFABaABDGPTTRFCQKLAKNHDMVVVSPIbB 
RDSKHUDVLWBrTAWISNSGSWLGKTRXNHIPRYGDFNKSmni 
EGNI/niPVFQTQFGRJAViJICYGRHHPIjKSfLMYSINGABIIFNP 
SAXIGAL 6 BSLWPIKARMAAIANHCFTCAINRVGTKHFPNRFT 3 
GDGKKAHQDFGYFYGSSYVAAPDSSSTPGLSESHDGIXVAKLDb 
NLCQQVNDVWHFKMTGRYBMYARB 1 ABAVRSKYS PTIVKE* PAS 
VPALG 



YaiVTGP3LCAGDKQPKKQBKNPVLVSPKFVI>KALCACKKYL5N 
LAfiKDIDKDLSAPLYLTFEGWSLFIiQRYYQVVHBGAKLRHLDTQ 
VQRCE&ILQQI£AVVPQIDMEG37RNIWIVKPGA]LSRGRGlf<CKD 
HLEEMLKLVNGFPVVMKDG KWWQKY IKRPLLI FGTKFDLRQWF 
LVTDWNPLTWFYRDSYIRFSTQPFSLKNLDK*APLYLTPEGWS 

lflqryyqwhbgaelrhldtqvqrcedilqqloawpqidmeg 

DHNIH rVKPGAKSRGRGIMCHDHLBKMLKLVNGHPVVMKDG KWV 
VQKYI BRPLLIFGTKFDLRQWFLVTDHNPLTVWFYRDSY IRFST 
QPPSLKNLDK 



VWPVPSVSAMPPKKQAQAGGSKKAEQKKKKKI IBDKTFGLKNKK 
(3UCQQKFIKAVTHQVKFG0 Q HPRQVAQSEAEKKLKKDDKKKHLQ 
KLNBLFKPVVAAQIQSKGADPICSVVCAFFKQGQCTKCDiCCKFSH 
DLTLHRKCKKRSVYinAHDHELBKDTHDNWDE KXLEHWNKXHG 
RABKXKPKTOIVCKHFLBAI KNNKYGWFHVCPGGGD I CMYRHAL 
PPGFVLKXKKKKKJCKKDBISL*DLIERRRSAI/5PNVTKITLESP 

lahkkrkrqbkidklbqdmerrkadfkagkalvisgjibvfbfep 

BTGITVASIJBHFSTyTSDXDBNKIjSBASGGRABIKJKRSDLHHDN 

ERBBTHHGAinAVPVDKNIiFTGEDLDHI^EKIJiTIJlLKE 



APAAAHAKRPKDLNLPNAVITRI 1 KKALPDGVNISKEAJiSAISR 
AASVFVLYAlSCANNFAMKGKRKFIJtASDVLSAMKBMKFQRPVT 
PIJCEALEAYnREQKOKKKASKQKKKDKDKKTDSKEQPKSRDBDN 
DBDKBRLBSEBQNBBBEVDN* KnRETV7VPHKVPLEMRRATCFCB 
AFPCHAB 



STKVilLFHTCRUkGCTVFITGASI^IGKAlAIiKAAXDQANIVlA 

AKTAQPHPKLLCTIYTAABKIKAVGGXALPCIVDVRDEQQISAA 

VRKAIXKFGGIDILVNNASAISLTMTUyTPTKRU)LHHNVNTRG 

TTltASKAClPYIiKXSKVAHlPNXSPPliBLITPVWFKQHGGRH^VV 

G^GDGLCLICFELNLCMSDVITICT 



HFMQS 0 VBLDCDIBPCBHTKF PPTLPLSTTVIVCSCHPVAIAST 

maeafskttskbdqsiqbpkkaksmtaqkqkk*glrgsrrrkan 

SGGDI FGDS FAAYFPRVIiKQVHQALSLSOEAVSVMDSMVRDIIiD 

riatbagmlahysxcvtitsrdirmavclllpgkngiclaesqgt 

NATLRYTKSK 



GLCRAOGAHRFSRTHVBAAIJCMLRRKARLRRBYLYRKARBEAQR 

SAQHRKRSLRRALHHNRIalPTRLRREALALQGSLEFDDAGOSGV 

TSHVDDKYRWAGVKDPKVNITTSRDPSSRLKMFAKKLKIiVFPGA 




460 
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Amino acid segment containing signal peptide 
(A^Alanine, C-Cysteine, D^Aspartlc Acid, 8= 
Glutamic Acid, F*»Phenylalanine, G=Glycine, 
HoHistidine, I =* Isoleucine, K=Lysine, 
L=Leucine , M=Methionine, N=Asparagine, 
PeProline, Q=Glutamine, R=Axginine, 

S= Serine, T=Threonine, V=Valine, 

H -Tryptophan, Y=»Tyrosine, X-Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








QRMNRGRHBVGALVRACKANGVTDLlArVHKHRSTPVGLIVSHLP 
FGPTAYPTbCNVVMRHDIPDUrmSKAKPHLITHGFSSRLGKRV 
SDILHYLPPVPKDDSHRVITEAHUDDYISPHHHVyiGCroHHlJVB 
LTEVGPRFKIiKLYNI RliJTLHQBAlADVKWIOniPYTIITT^RKKVP 
IkSTB+AAPRPI/3QLI» 




3 


288 


SVDHRBVQVXiSQ^PLTPHOAVLRGKRPYMCVKCGKCFGHSSHL 

lqhqrihtgkkpyvcsvogkapsossvlskhrtihtgskpybcn 

ECGKAFRVSSDlAQHRKlHTGBKPHBtZIiECRKAFTQLSHLIQHO 

HIHTGKRPYVCPLOGKAPHHSTVLHSHQRVHrGKKPHRCNECGK 

tfsvkrtllqhqrihtgbkpytcsbcgkapsdbsvliqhhhvht 

GEKPYECSBOGKTPSHRSTLMNHKRIHTKKKPYACYBCaKAFVO 
HSKLIQHQKVHRKL* PTCVLSVGSAIAGVPTSPSISVSTLJgRSP 
MCAVYVGRPSARAQSLVNTOQPTQVRSPHSVHSVKKPLK 


■ 


1060 


859 


PRPPGKHWTVAGIiQrPGIiPGIRHSVGWAVLGQLftRRUJVABSWT “ 
HDHHC?UU>lAXAPIXn)AQLVtiLSPRRI#!HAHGRSVARAAELFGl» 
TASBVYLVHDBU)KPLGRliALKLGG3ARGHHGVRSCISCU9SKA 
HPRLR VGJ IGRPAHPHAVQAHVLGCPSP ABQKLLPLLLDHATDU 
LDHIRfiKSOGPSLGP*H*WFSXKA 


6175 


2204 


334 


RYFHADPRSRSGQPRAEGIjGAFABGPIjRAMAAPVKGNRKQSTEG 
DAIiDPPAS PKPAGKQNGIQNPISIjKDSPEAGGBRBKBQKRZBEQ 
AFLVSLYKPHKERHTPIEKVPHDGPKDIHLIfKrYKRVKICbGAYK 
I»VTGRRLHKHVYNBLGGSPGSTSGATCTRHHY*RIiVLPTVKHLX 
GBDDKPLPTSKPRKQYKMAKBHIIGDDGATSRPKICAKHERRMDOM 

hpgktradaadpaplpsqeppshsteqoglasgssvspvgasgc 

PEAYKRLLSSPYOQGTHGIMSPIAKKKIiLRQVSKVEALGCQBEG 
C31HGAHPQASPAVHLPESPQSPKDLTEN3RHRLTPQBGLQAPGG 
SLREKAQAGPCPAAPI PKOCFYTHPTBVLKPVSQHPRDFPS RLK 
DGVUXJPPGKSGLSVKEPQLVHGGDtANRPSAFHRQGSHEGI I»YP 
KPKACWVSPMAKVPAESPTLPPTFPSSPGLGSKRSIiERRGAAHS 
GKRLRAVSPFLKEADAKKOGAKPAGSGLVSCLrOPAIGPVPPEA 
YIK5TMLHCPI^PTGTPGPbBGQAALP PSPLVI PAPPAHFLATAG 
PSPHAAGI2GIFPPTSFDSALRHHLCPASSAWHAPFOTTYAAPHF 
PHLHTKL 


6176 


~ lolo 


402 


PLSALRAMABVHVIGQIIGASGPSKSSliFCKKGIlfrGAAWKIilkS " 
GVREGQTQyDTPQIGDMAYWSHPIDLHFATKEb(X3WPRLHFQVW 
SQDSPGRCQbAGYGPCHVPSS PGTHQUlCPTSfRPLGSWRBQLAR 
AFVGGGPQLIiHGDTITSGADRYRIiHTAAGGTVHLKIGLLXaHNFD 

rygvec*gtlpptsppstprtpsdggghrsgqbhri. 


6177 


1400 


992 






1027 


254 


STQRGGIKGVARAAS LVGRR RAGTQ4AUiLCLVCLTAAIiAHG CL 

hchsnfskkfsfyrhhvnpkswwvgdi pvsgalltdwsdotmkb 
lhlai pakitrkkldqvaiavyqwidqlyqgkmyfpgyfpnklr 

jnFRBQVRLIQHAI IBSRIDCQHRCGIFQYBTISCNWCTDSHVA 
COTYNCKSSAGWKSAVQGLLHY innhhkqdtsmrprs srfswpo 
TBRAAPAPLVLPALR CUSPPHLANbSLBDAA 4 CLKQH 




806 


276 


BGBTREMAGNLLSGAGRRLWDWVPbACRSPSlGVPRi7fGIRl.TL 
PPPKWDRWNEKRAMFGVYDNIG1LC3»FEKHPKBLIRGPIWLRG 
lfnOGNBIXJRCIRKMMVGSRMPADDLBHLHKRIRYLYKHFNRHGK 
FR*KRXLRTSKKAHLS PWRRKTVLPPVRKRLCI FS VI KWGPFG I 
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Amino acid segment containing signal peptide 
(A=*Alanine, C^Cysteine, D=> Aspartic Acid, E=* 
Glutamic Acid, Phenyl alanine, G=Glycine, 

H=Hictidine, I^Isoleucine, Lysine, 
L=Leucine, M=Methionine, N= Asparag ine , 
P=Proline, Q=Glutamine, B=Arginine, 

S=Serine, TaThreonine, V= Valine, 
W-Tryptophan, YaTyrosine, X=>Unknovn, *=^top 
Codon, /^possible nucleotide deletion, 
Impossible nucleotide insertion) 








GNHXKSYVIiGVKGADIid)SGDLVHHWVDNVLSEyVMSEIinVYV 
TDCHV^TSAFSKAGMCLRCSACAIiNSVVOSVLSKRTIiQARSMHB 
VI ELLNVCKDLAGSTGIAKBTFGSLEBTSPPPCHN3VTDSLLI.V 

hbrybqicefysrakkmnliqslnkhllshlaailtpvkqavxb 

LSNKSQPTLQLVLPTYVRLBKLFTAKAHDAGXVSKLaiLFLBAL 

KEHFKVflPAHKVANXLDPQQKLRpVpPTQHEBIIGKVCBLINEV 

KBSHABBADFBPAAHKPRSAAVKSPAAQKDDSX/5KNBVYDYIX)B 

PLPQATPDIjFQYWSCVTQKHTRLRKLAFWLLAVPAVGARSGCVy 

NCSQAX*X*IERRRLLSPEDHNKLMPLKSHML 


6101 


169 


1032 


|| j j * > TpT -I7wK*Sp? n : 4 %■ j rJ TTTTj u ri y 4 ? i r •] •) • V t : 1 : « J nX* fl * 


6182 


1769 


1224 


AS* XDYQlMTLLKRFQLTGENTKLRyLTCSLXBDHAAAYFPDCI 
VHPPGSGVNTFGKLGCDLDHPIiDUSTRNLSAHKISQJFUlBPO 
VKNVPSBaiA7X)iai^VI/lBCU>UPaPGCVaVQKIIiMARCPLVR 
FSRQASGPQCDLTTKNRIALTSSBLLYIYGAIjDSRVRALVFSVR 
CSCARAHSLTSSIPGAHITNFSLTHMVTFFLQRRSPPXLPTLDSI* 
KTIiADASDKCVXEGRNCTFVRDLSRI KPSONTBTLBLLLKEFFB 
YFGNFAFDKNSXNXRQGREQNKPDSSPLYXC2I7PFKTSLHISKNV 
SQ3QLQKFVDIJ\RESAWI LQQEDTDRPSISSirRPWGLVS LLLPS 
APNRKSPTKKKSNKFAIKTVKNI*LKSLXGNRTERPTKTSGKRTI 
STQT 


6183 


111B 


452 


fflJJRYIKSPGSGSBTPAPPSHUiLYLIiHPQSTRTMGCCGCSRGC 

gbgcqgcgsscggogsgcggcgsgrggcgsgcggcsssoggcgs 

RCYVPVCCCKPVCSHVPACSCTSCG3CGGSKGGCGSCGGSKGGC 



GSOGCSQSSCCKPCCCSSGC3GSSCCQ3SCCKPCCOOSSCXrVPVC 



CQSSCCKFCCa&NCCVPVCOQCKI * GSGPRPSGFSCLVXAFUS 



6164 



VP 




IVTVRBBDGAPAVAPPGVWSRANlUlSGAGPGGSdTCJGARGARB 
BPPPPLQAVLVAD5FDRRFFP ISKDQP'EVLLPLAHVALIDYTLB 
PLTATGVQBTTVFCCWKAAQIKKHIiLKSKSfCRPTSLNVVRIITS 
BLYRSMDVLRDVEtflKALVRSDFLLVyGDVXSNXHITRAIiBBHR 
LRRKL* KNVSVKTMI FKKSS PSHPTRCHKDNVVVAVnSTmRVL 

hfq^qglrrfafplslfqossdgvbvsydlldchxsicspqva 

QLFTDHFDyQTRDDPVRGLLVNEHILGHOIHJOJVTAKBYGARVS 
NLHMYSAVCADVI fcRWVYP LTPKANPTDSTTQSCTHSRHNI YRG 
PBVSLGSG3XLEEHVLLGSGTVIGSHCPITNSVIGPGCHIKPGD 
HVVLDQTYIJWQCWRVAAGAQIHQSIjLCDHABVKERVTIiKPRSVI* 
rSQWVGPNXTLPEGSV13I*HPPDAKBDBDDGBPSDDSGADQKK 
DKVKMKGYNPABVGAAGKGYZfWKAASDMEKEBEZiOQNLUGUCX 
NHEBBSBSESEQSMDSRRPDSRGGSPQMDDIEVFQBBVLGTLQR 
GXEBN1SCDHLVLBIN5LKYAYNISLKBVMQVLSHWLBFPLQQ 
MDSPLDSSRVCAIXLPltLKAWSPVFfiNYXKRAADHLBALAAIBD 
PPLEHBALGISMAKVIMAFYQLBIIABKTILSWFSQRDTTDKGQ 
QLRKBQQLQRFIQWLKEAKEKSSBDD 



791 



44 



PCTSCVLWAXUILPASTRKAPaAECGMISITEWQKIGVGlTGFG 
I PPiriFGTLL>YFDSVIiLAFGI3LLFI»TGliSLI 1GLRKTFWPPFQR. 
HJQiKGTSFLIiGGVVIVLLRttPIJXMFIiSTYGFFSLFKBFPPVAF 
GPUGHVCHI PPIX1M>PRRLQGTSSHV* KTEMSSLNLDHHLKGAK 

rbkwkpppqspalthsptypgppovqkbrhgaeqltshpqvdsr 

GCQKRH4QTPimU5WG?nmTLTLYLHKKK 



| 6186 | 569^ 



238 



i fc 5 0 < 7 ; ■< kj H~;« ;« i i ?K< r* : EgTH? j a >iv ; ^ fi Z 
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KBaKVKNKVKNKADrEBVFHNSPTOQBKMPTSATDPD^SGSVIS 
NtRNQMETTiHSQPHQKKHLCFBRSFSLlHLLPIHAVKPTSSQQI 

LBDCmVGLHlOGinJtfWniRXPySKSBIlfavCSVGC C SHLL3 8 
EFE^HKKRTQKKJIGPP&OJYIjKEHRRCCGRNARMSACLAI»ERV 
AASQGLPTBSLFVRAVLQDI lg D CTGlTKCTOBV O KlYSK CSSP 
LDYVHRSLKKIfiLDBSKIiPBKIlMIIYYEHyiCPHMNBIiRAFIC^LK 
VVlAPCXBTIJIil^RI^YIiKHQKDmfSALVKLFDPVKSPRCYA 
VIALKKQQ* FP1KQI IRCISL* DSAGCABKVSVG^GGPALRDAP 
PSGSHVGSRYD 



DAWGPETRLARIXNPDSFIEPRPGRLP 
AAPI>lERXPHVLVGVTGSVAAIiKLPLIiVSJCLLDI PGLSVAWTT 
EfiAKHFVS PQDrPVTLYSnADKWEHWKSRSDPVimDLRRHADL 
LLVAPLBAHTI£KVASGI(32NLLlt!VMRAHI)8S KPUiFCFAMtfT 
AKVKHPITAQQVDQLKAPPYVBIPC^KKLVCGPEGLGJVKREVG 
TIVDKVKEVLFQHSGFQQS *PGISVMGVPbYSEJ»VQAKSVKMDV 
GKIGGY PHU^GGPAI>S LPRGQACSRUfifTSGPGLS PFQPGEAA 
A 



KGFV^2AGPUfaSLQVSPQHKAPEMS0rCXSGGHPSA # GPRNAGH 
NIGVPICIRC»GIHRNU^7HISHVXSVKLDQWTQEQI(X>1QHKO 
NCKAJ*RI*YBAYLPBTPRRPQII>PAVBGF3RDXYBKKHYMDRSLD 
XKAFRKBKDDKHKROSEPVPBKKLB PVVPHKVKMPQKKBD PQLP 
RKSS PTCSTAPVMDLIOTnAPVACSIJUeirrSlITLElaJIiDIiIiASV 
PSPSSSGSRKWGSHPTAGSAGSVPSNXJnjFPBPGSKSEBIGKK 
QbSKDS XI^LYGSQTPQMPTQAMFWAPAQMAYPTAYPS FPGVTP 
PNSXMGSMMPPPVt34VAQPG7\SGMVAPWAMPAGYMGO<QASMMG 
VPEJGMMTTQQW^fMAGHAAMPQTVYGVQEAQQIjQWNLTQMTQQH 
^hWPYGAKGMMNYt^}SMSGGNEQAANQTI*3PQMWK 



U3 B PLGDLCEb I iX3DVQQLQHGBVHTOTOAQGSAAQSVAG3VQL 
TQLSUARQRPSCQC3SQl»XAZi>LQHMDrSRQPREQHVOPVAROVO 
RAQQAQLiAB(3VAVHLHAGnAVVAKVEIiIX2HVGGGXVFAANACDL 
VVQDHEG3VH7iARQRTGHAIiQRVIVaVRRVOPLEAb*RVPSGI*PR 

rvrafmilhijqitcioreofattyflbsuilsyhritspqvhrd 

APRKI*Rf JjRSLDXjSGHgLHMLPPGIjPRWVHVLKVKRNHLAAIjftR 
GALAGMAQLRELyLTSH&l»RSRALGP9ANVDLAHLQLLDIAGNQ 
LTBI PSGLPKS LBYLYLQlOnaSAVPANAFDSTPNIiKGI FLRPN 
KI^VG SVVDSAFRRLKHIjOVUDIEGH LSFGDISKDRGHLGK5KB 
BERBDSVBBBBTR 



ILV(3»VSFIiLSJ%ByVC»CSVVGSLNVNRafOTTGQCfiCRPGYU 
G^CBTCKBGFTIJfYTSGbCQPCDCS PHGALS I PCNSSGKCQCK 
VGVT GS ICDRCQDGYYGFSKHGCLPCQQCIRSAS CDALTGACLN 
OQPSNSKGNBCEBCXEGFYQSraAXKBCljBCPCSAVTSTGSCStK 
SSEI^PBCDQCKlXjYXGPNQTKCEKUyYNPDfllCRKOQCHGHVY 
PVKTPKICKPBSGBCattlClJOrnX3PHCBNCI»*CJyVHDIiBGNCI K 
KVILPTPEQSTILVSHftSLTTS V PTPVl ^gX PT P T lli QTlPSVS 
TSENSTSALADVSWTQFNI I JXTVI 1 IVVVI^LWGFVGAVYWYRK 
nmKLKAPPHTI ELKKIJttl SFS5YHDS I PNADVSGLLHDDGHK 
VAPNGQKTLTTPIHNYKA 




TRGCGNKMflGKKNVLSSIAVYABDSBPBSDGBAGI EAVGSAAEK 
RBGLVSHATGEDDPSRLGGDEDGYEBEBDHSSRQSEDDOSBTSK 
PgADDPKDNTEASKRDPQKLVASFSKRVRNMSPDEl KIPPBPPG 
RCSWHLQDKIQKLYERICIREQIDMNYllCStKKEPWJnPSIYBKliI 
OPCaiDHDGTHYPKDMTOPHBWSEDSYyBALAKAQKIBMDKLBK 
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BBHB 








akkbrtkibpviotkkbtttnatsttttxastavadaqkrks kw 

DSAI PVTTIAQPTIL7TTMLPAVVTVTTSASGSKTTVI SAVGT 
IVKKAKQ 


6133 


3 


950 


TKKljWKMAGKKNVl^SIAVXAEDSKPKSDGKftGIKAVCJSAAEB 
KGgLVSDAYGB0DgSRU3GDgDGYS3g8DBBRQ^DDPS3TBK 
PRADDPKDNTEAKKHDPQKLVMPSBRVRKMSPDKTKTPPBPPG 
RCSHHTiQPKTQgLYBSICXKBOWDMHTX IQRKKKFRNPS X YHKLI 
OPCAIDEWnjJYPKDMFDPHSWSBDSTYEfiLAKAOKXBMDKLBK 
AKKKRTKIB PVTGTKEGTTTOA'fS ITT 1 1AS1AVADAQKRJCSKM 
DSAI 7VITIAQPTILTTlATLPAVVTVTrSASGSKTTVISAVC?r 
IVKKAKQ 


6194 


3 


950 


TRCOGNKMAGKJttJVl^lAV^AEDSBPeSDQKAGIEAVGSAAKB 
KGGLVSDAYGHDDFSRLGGDBDGYKBBEE®N3H09EDI3DS BTEK 
PKADDPKIttITSABKHDPCELWASFaBRVRWOPDSIKIPPEPPG 








RCSNHLQPKIOKLYEHKlIBCTWiyn IQBXKBFRNPSI Y6KLI 
QPCAI DKUJXNYPKIPlFDPHG^HDSYYBfiLAKAQKIBMDlCLKK 








dsaipvttiaqptilttiatlpavvtvttsasg^ktivisavgt 

IVKKAKQ 


6195 


736 




VANGlQSN>IPKJ^CinfCl)TYLTHDSPSVRKTHC3GRKHKENV^"~ 

YYQXBHKBQAQSLIDKrTAAPQQGKIPPTPPSAPPPAGRMIPPP 

pslpgpprpommpaphmgopppmpmwgppppgmmpvgpapgmrp 

PMGGHMPMMPGPPHMRPPARPMKVPTRPGHniPDR 


6196 


1512 


623 


KTG10taSAWfVH»IUniABaVISNLBARNiSPRL^PLiQBEDai“ 

QRLIjttGIiMVS ELKDHPLHHLQGVBKK1CI BQMVLDY I S KLLDLI C 

HIVBTNWKHNLHSHVLHPNSRGSAABPAVPHIMTRILBATNSL 

PLPIJ>PGFHTIflTII£VQCId>IiWIjaCIDSGVIiLLTBIAVIRl» 

MKDIiDNTSKNSKLKFSirVRLPPLIOQKICSLWDHPMSSNI ISR 

HHVTRIJjQNYKK0PRNSWIli)CS3PSVEPLPUWKIBIT*TDIKSS 

HQAbyPPBGHnNVDABFVBEAALKHXAMIdjGL 


6197 


3 


819 


ADPBOTE3AVMSKYTRPPffTSi*ETCKNVAI3ATRPKDLRJKRFGaYQ 

PIVDVTIPU)PTTRRPRG?ATVOPBDVRIlAEDAt*yHXiNRKKVC^ 

rqibiqfaqgdrktpgqhkskkrhpcspsdhrrsrspsqrrtrs 








srtprrnfgsrgrsrskslqkrsksigkbossspqkiqtosgtks 
RSHGRHSD 8IARSPCKSPXBY TOPBTHVQIWtHSHPRSHSRSRS 
YRHKN3W 


619B 


111 


1912 


SBAAZ^PSFISPJU7L(AKLPAltBDGTI^HPDTU^(NYBGAR5R 
RKHHAADDS EGGALDMCCSHRL PGliPQP IVMEAIjDSAKGLQDSQ 
RRMPPPPPPSPPSDPAQKPPPRDAGSUSLTVRSSLCDPAASOFI* 
LAa?VLWFSGTCH3WSQNA7NI»VSSU.TIJJCQLBP , TAWIiDSGTW 
GVPSLLLVFl>SGGLVLVl , fLVWHIJ£TPPBPPTPI*PPKDRROSV 
SRQPSFTTCBtiSHBBKIBDDFliDlDPVPBTpVPDCVMDI KPHftDP 
TSIjTVKSMBIQKRROSHVSLTld3HCTPOCtfBEGFGYU4S PRESS 
ARBYLLSASRVTiQAKHiHKKTVLDPPIXQAEPFBIPriNFVDPKEY 
DIPGI/VRKRRYKTHiPNPBSKVCLTSPDPDDPLSSYINANyiRO 
TGGBKKVTIATQGPIVSTVADPWRMVWQKHTPirVKITNIRHMN 
KKCrTBYWPBBQVTVYIXjyBrrVQKVIHimJYRIiRLlSLKSGTEKR 

glkhywftswpdqktpdrappllhlvrbvrbaaqqbgphcapii 

VHC^AGIGRTGCPIATSICCQQISQBGVVDILKTTCQIiRQDRGG 

HXQHCBQYQFVHSVNSLYBKQLSHQSPE 



■ 


144 


1211 


MABtENGHSSSSWKKQABDIKKIFKFKBTUsTGAPSEVVLAREKA 
T3KDFAVKCI PKKALEGKBSSIKNBIAVLRKIKHKNrVAlRDIY 
BS PNHLYLVI^LVSGGKLFnRrWBiOSFSTBKDASTLIRQVLZiAV 
YY1HRMGIVHRDLKPKN1LYYSQDKRSKIM1SDFGLSKMBGKGD 
VMSTAOGTPGYVT^PHVLAQKPYSKAVDCWS iGVXAyiliLCGYPP 



464 
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IRKM 








FYDEND3KI*PEQlIiKAKYBFDSPYWDDISDSAKDPIRNI21EKDP 

NKRYTCEOftARHPHlAGDTALirKNIHBSVSAQlRKNPAKSKMRQ 

AFNAXAVVEHMRKLHLGSSLDSSN71SVSSSLSIASQKDCASGTP 

HAL* 


6200 


102 


96 


”£pKVPHSLRPRVin*HLCC3^PA^VMAHIiPKLAVFDId)YTI*HPP 

WVDTHVDPPPHKSSDOIVHDHHGQDVRLYPBVPBVLKBLQSLGV 

PGAAASRTSBIEOANQLLBLFDLPRYPVBfiElYPGSKXTHFHRL 

QQlCIGIPFSQMIFFDDERRNrVDVSKLGVTCIHIQNGNNLQTLS 

QGLBrFAKAQTCPLRSSLEBSpPBA 


6201 


2809 


2383 


GQTPRVPWKWKRStiELACKRRQTAGRKSKSPPKVPIVIQDDSLPA 

GPPPQIRILKRPTSNGWSSPNST6RPTLPVKSLAQREABYAEA 

JUCRIXGSASPBHEQRKPILDRPTRISQPBDSRQPNNVTRQPLGP 

DGSQGFBQRR 


6202 


2 


426 


1NADRAAVA3SLLSRFTRKMAPQKBRKPKRSTNRFHLDLTHPVB 
DG I FDS<3iFEOPLRBKVKVNGKTGNLGNVVHiraiFKNIITVVSK 
KQFSKRYlJGfLTKKYLKKNNMLWLHVVASDEKTYRLRYFQISQ 
DKDKSESED 


6203 

. 

1 

1 


419 


2550 


RCPRPPATAGAAASRPDaSPPSG ISGSEAAAGAQAAAPA3QHPA 
TGPGAVQTBAHKQILGVI DlTOURHIiBKKKn KI .DDYQHEHNBCTR 
UfQDQLZ^VSKyQEVnDfLBFAKELQRSPNALSQDIQHTIKEXA 
RREQLMRHBAEQKRLKTVLBLOYVLDKLOTDSVSTDLKQGUiGV 
P ILSBBHLSLLDEPYKLVDPERDHSLRLNBQ YBHAS I HLWDLLE 
GKKKPVCGTTYKVLKETVERVFQSIJYPDSTHNHQNGLCB E3KAA 
SAPAVKDQVPKAEPBPABBVTEQSEVESTKYVNRQFMABTQPTS 
GBKEQVDEHTVBTVBWNStiQQQPQAASPSVPBPHSLTPVAQAD 
PLVRftQRVQDLMAQMQGPYNPiaDSHLDFKNQrLDPAlVaAQPM 
NPTOTMDMPQLVCPPVHSKsnLAQPNQVPVQPEATOVPLVSSTS 
EGYTASQFLYQPSHATEQRPQKBPIDQIQATISLNTDQTTAS3S 
LPAASQPQVFWOTSKPliHSSGINVNAAPFQSPlCrrVFKWNAPVP 
PVHBPBTLX(K2PQVQA5T2iQSFSSQPIQ}VBQTBLOQBQLQTVVG 
TVBGSPDQSHQVTQiKQOPPQQNTGPPRSNQPYYNSBRGVSIlGGS 
RGABGLMNGYRGPANGFRGGYDGYRPSPSNTPNSGYTQSQPaAP 
RDYSGYQHDGYQQNFKRGSGQSGPRGAPRGRGGPPRP23RGMPQM 
RTQOVN 




2933 

j 


787 


CTHNLlSLLGGHALIHFNRFIJiLKIQKGHAHNIPCPAYDCFQXjV - ' 

PGDIIB^VVSKEMDKRYLQFDrKAFVKNNPAlK»CPTPGCDRAV 

MiTKQGSNTfiGSDTLSPPIJiRAPAVDCGKGHLFCWBCLGRAHKP 

CBOQTWKNNLOKITEMKPBBLVGVS KAYEnAANCLWLLTNSKPC 

ANCKSPlQKNEGCNHB^CAKCRYDFCWlCEjBBIfKKHSFVHHErVl 

YRCTRYBVIQHVBBQSKEMTVKABXKHKRFQELDRPMHYYTRFK 

NHEHSYOLBQRmCrAKBKKBQLSRALKBTSGQgPDTTPlBDAV 

HVIXKTRRILKCSYPYGFFLBPKSTKKBI FEXMQTDLEMVTEDL 

AQKVNRPYLRTPRHK1 1KAACLVQQHRQEPIjA3VARGVAPADSP 

EAPRB5FW3GTW)WBYLGPASPBEXAEPOYRRHHRQRRRGDVHS 

LLSNPPDPDKPSKSTLDIPBGGSSSRRPGTSWSSASMSVLHSS 

SliRDYTPASRSBBQDSIiQALSSLOEDDPNIIilAlQLSLQBSGLA 

LDBBTRDPLSNHASLGAIGTSLPSRLDSVPRNTDSPRAALSSSE 

LLSLGDSLHRLGAEN3PPSTDTLSSKPLSEARSDPCPSSSDPDS 

AGQDPNINDHL1jGNIMAWFHDMNPQSIAI»IPPATTBISABSQLP 

CIKDGfiBGVKDVBLVLPBD3KPEnASV3BGRGTQlEENPLHB»l 

PGGGKQHPQAM 


6205 


1 


1200 


RAkE^£Q4ALBVGDHEDGQLSDSDSDKi^APSbRPLQl»PKVLGGD “ 
SAMRAFWTAIACAPVSHYHAVESVDSSBES FSDSDDDSCLWKR 

krqkcfhpppkpbpfqfgqssqrppvaggkkinnihgavlqbqn 

0DAVATKLGII/5MEGTIDRSRQSETYNYLLAKKLRKKSQBHTKD 

LDKBLDHYMRQGKKMt3SKBBBNGQGHIiKRKRPVKDRLGNRPBMH 




465 



















WO 01/53312 



PCMJS00/34263 



Sgo 

ID 

ND: 


Predicted 
beginning 
nucleotide 
location 
cor i. sb pond i ng 
to first 
amino acid 
residue of 
amino add 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

to first 
amino acid 
residue of 
acid 
sequence 

. 


Amino acid segment containing signal peptide 
(AoAlanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, P«Phenyl alanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine. 
b=>Leucine, M=Methionine, tf=Asparagine, 
P=Proline, Q=Glutamina , ft ° Arginine, 

S=Serine, T=Threcaine, V-Valine, 
M=Txyptopban ( Y*>Tyrooine, X=Un)movn, **=Stap 
Codon, /^possible nucleotide deletion, 
Y=poasible nucleotide insertion) 








YKGHYKITAKDSOKiCVADBISPRIiOBPKKDUARVVRIIGMKKA 
I3LLMETASVEQKGGLPIf03G3R2^PGGVPLNLIJQ?ri?SISEE 
QIEDIFYIHNQKBYENKKAAKKRHTQVLGKKMK0AIKSI2JPQBD 
DDTSRBTFASnTNKAIjASLDBSQBGHAEAKLEAEBM EVDHSHD 
LDIF 


1 


10 


1442 


II3ERJtBIt3CIiHLVCIRCSQ>VVEMGSViiGIiCSMAS>fIPClCGS 
APOXCBCCPSQDmVTRLIYAbFI>LVGVCVACVHl»IPGKEB0 
LJJKI PGPCBNKKGVVPCNIbVGYKAVYRIjCFGUJiPYLIiLSIJH 
IKVKSSSDPRAAVHHGFWPFKKAAAiAIIIGAFPIPBGTPTTVW 
PYVGHAGAFCFXLIQLVLLIDPAHSKHKSWKKMHEGHSRCVfYA 
ALLSATALNYLLSLVAIVLPPVYYTHPASCSBNKAPISVNMIiIiC 
VGASVMSILPKJQESQPRSGliQSSVITVYTMYLTHaAICrNKPK 
TOCNPSLLSIIGYHTTSTVPKEGQSVQWHHAQGIIGLHiFLIiCV 
PYSSZRTSNHSQVMlELIIiTSDBSTLIBDGGARSDGSIiZDGDIJVH 
HAVCHBRDGVTYSYSFPHFMLFIASIiYI WMTI/IHirniYBPSHEM 
KSQWTAVWVXISSSa IGrVliYVT?TLVAPLVI.TNRDPI> 


6207 


2924 


1471 


~TVMAKAATPGTTA5T*SGAGAAAA3AAAAS PTPIPTVTAPSI/JAGl 
1 GGGGGSDGSGGGKTKQVTC3lYFMHGVCKBGI3NCHYfiHDI»SDSPY 
! SVVCKYPQKGYtar«3DHCKYEHSXPLKQEKA3MBLTTKSSIAA 
SSSLSSrVOPLVEMNTGBAESRHSNFATVGAGSKDWVHAIBPVP 

VGBCR¥GBHCVYIiSS2)tSODI4C6bQVLUPNDAAQRSQH I KSCIKA 
HBKDMBLSFAVQRSH3JMVCX3ICPIEVVYKKANPSERRFGILSNCN 
HTTCLKCIRKHRSAKQPBSKI IKSCPBCRITSNFV1PSBYWVKE 
KKBKQKLILKYKBAHSNKACRYFDBGRGSCPFGGNCPYKHAYPD 
GRRBSPQRQKVGTSSRYRAQRRNHPWKLIEERENSNPFDNDEBB 
VVTFSIJGBMIiLMIiLAAGGDDBLTDS EDBWDLFHDBLEDFYDLDI* 


620B 


2924 


1471 

i 


TVMABAATPGTTATTSGAOAAAATAAAASPTPIPTVXAPSU3AG 
CK5GGGS1X3SGGGHTI0QVTCRYFMHGVCKBGDNCRYSHDLSDS P Y 
SVVCKYPQRGYCIYGDRCRYKHSKPLKQBBATATKLTTKSSLAA 
SSSLSSXVGPLYKMHTGKABSRHSNPATVCAGSKDWVHAXEFVP 
GQPY GGRTAPSCTBAPbQGSVTKBBSERBQflAVBTKKQIiCPYAA 
VGECRYGKHCVYLHGDS CDHCGI^VI^PMimAQRSQHI KS CIEA 
HBKDHHLSPAVQRSKWVOGTICHBVVYBKMrPflBRRPOILSNQ? 
HTYCLKCIRKSTOSAKQFE8KIIKSCPBCRITSNFVIPSKYKVEK 
KEBKQKLILHYKBRMSHTCACRYFDEGRGSCPFGGNCFYKHAYPD 
GimBBPQRQXVCTSSBYRAQRRHHFfiKLIBRRESSNPFDNDESB 
WTFBLMtLIJgJJWGGDDgLTOSBDBHDLPHDgLBDFYDUH* 


1 


TtSb 


829 


ERLCKPCMQSKIYSYMSPiiKCSGtiRFPlJQKKHSVTHHHVKCQGK 

PLAGIYRKRRBKRNfiaiAVRSAHKSBBQKIKDARKGPLVPPPNQ 

RSRAABPPRTPPRSCDSTHAAZARQAIiKEPIKGKQAPRHKAQGK 

TQQNRKLTDFYPVRRS6RXSKABLQ3KKRKRIDKLIBSGXBEGM 

KIDLIDGKGRGVIATK0FSR«H)PVVBYHI2JLIBITnAKKRKALY 

AQDPSTGCYMYYFQYLSKTYCVDA'rRBTHELGRLINHSKOGNCQ 

TKLHDIDGVPHLIIilASHDIAAGHBbbYIWGDRSKASIKAHPWL 

KH 


6210 


3761 


387 


IFGMSlOJlMVLI^DSGSADFRiUil'VMLSPPTITWliIiLSACFVT 

sslggtdkelrlvdgehkcsgrvevkvqhebgtvcnngwsmkav 

SVICHQIiOCPTAIKAPGWANSSAGSGRIVWDHVSCRtaffiSAIjHD 

CKHDGWaKHSNCTHQQDAGVTCSDGSTSIiBMRLTRGGNMCSGRIB 

ikfqgrkgtvcddnfnidhasvichqi^cgsavs fsgssnfgeg 
SGPIWFDDLICNGNKSALWNCKHQGWGKHNCDHABDAGVICSKG 
ADL^I^LVDGTVTECSGRLRVR FQGEWGT I CDDGWBSYDAAVACK 
QUK:pTAVTAIGRVNASKGFG}OWIJ>SVSCQGHBPAVHQCKHHB 
t«aCHYC3IHHBDAflVTCSDGSDIjBIdlLRGGG3RCAGTVBVBIGRL 
LGKVCDRGWt3IiKEADVVCRQLGCGSAI»KYS YQVY 3 KIQATNTWL 
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Amino acid segment containing signal peptide 
(A»Alanine, C^Cynteine, D=Aspartic Acid, 8- 
Glutamic Acid, Fnphenyl alanine, G=Glycine, 
H=Hietidinc, I=Ieoleucirte, K^Lysine, 

L= Leucine, M=Hethionine, N^Asparagine, 
P=Proline, Q=Glut amine, R=Arginina, 

S=Serine, ToThreonine, V=» Valine, 

W =>Tryp t ophan , Y^Tyrosine, X=Unknovn, *s=Stop 

Codon, /=po3sible nucleotide deletion, 

\=possible nucleotide insert ion) 

FLSSQfGfiFKreLWDCKLIWQWGGLTCDHYBKAKITCSAHRBPRLV - 

GGDXPC5GRVKVKHGDTKGS 1 a^D^SI^AASVI^HELQOlTVV 

S ILGGAHFGBQJC^jrWAEHPQCEEHRSHLSLCPVAP KPBGTCSH 

SRDVGWCSHYTKIRLVNGKTPCECTVELKTLGA5?GSL£3JSHWD 

IEXnHVXX3QQLKCGVALSTPGGARPGKGHGQIHSRMFHCTGTEQ 

HMGDCPVTALGASLCP8EQV7l3VlCS(afQSQTIjSSCSS3SIiGPT 

RPTXPBBSAVACIKSGQIiRLVNGGGRCAGHVKrmEGSWGTICD 

DSWOLSIIAHWCROLGOGKAINATGSAHFGBGTGP IWLDEHKCH 

GKKSRXWQCHSHGWGQQN CRHKKDAGV I CS E PMS LRLTSEASRB 

ACAGRLBVTYNGA5<nVGKSSl^KTTVGVVCRQLGCAI)KnKINP 

ASliDKAMSIPMWVDNVQCPXGPDTLWQCPSSPWEKRLASPSBBT 

HrrCINKIRLQEGPTrsCSGRVBIHRGGSWGTVCDDS¥DLDnAQV 

VOQQLGOGPALKAFKBAEFGQGTGPIWIdfBVKCKGHBSSXiHDCP 

ARRHGHSEOaHKKDAAVNCrDISVQKXPQKATTtmSSROSSFIA 

vgilgwllaipvalpfltkkrrqrqriavssroklvhqiqyr 

EMHSCLHADDH)LMNSSGCatSBPH 

I FGHSKIiRKVLLKDSGSADFRHH8Vtfl^P^XTWIJJ>SACFVT 
SSLGGTDKBLRLVDGB3STKCS<3tVHVKVQEBHGTVCNHG¥affiAV 
SVXQ9QLGCPTAXKAPGHAMSSAGSGRXHNQnV3CRt9fBSAUn) 
C! K HDG W GKHSHCIB00nAGWTC3iy>3NXfiHRLTOGGWCSGRXH 
IgPQGregn V a roN PMIDHASVXC R OLSCGSAVSPSGSSHPQBQ 
SGPIHPDDLICNGNBSALMNCKHQGWGKHNCDHAKDftGVI CSKG 
ADLSIiRLVDGVTECSGRLEVRFQGEWGTICDBGyroSYnAAVACK 
QLGCPTAVtAIGflVMASKGFXaHIMLDSVSCQGHEPAVWQCKHHH 
WGKHYt>3HNRDAflVTCSDGSni.Rl;T?I,RGGGSRCAGTVEVEIQRL 
I/JXVCDRGWQLKEADVVCRQl^CXlSAIJCTSYQfVYSKIOA'PNTWL 
FLSSCNGNKTSIiWDCKHWQNGGI,TCDHYEEAKITC3AHRBPRIjV 
GGDIPCSGRVKVKHGDTWGSICI^DFSLRAASVWaiBIiQOCnVV 
SXLOGAHPQESNQQXWAKBFQCEGHBSHLSIiCFVAPRI'EGTCSH 
SRDVGWCSRYTSIRLVNGKXPCSGaVKLKXLGAWGSIfCNSHWD 
IHElAHVljCQQLKEGVALSTPGGARPGKiaJGOIWRHMPHCrGTEQ 
HMGDCPVTALGASLCPSRQVASVICSQJOSOTLSSCRSSSIiGPT 

DSWDLSDAHWCRQLGCGKAXN7VTGSAHFGEGTGPIKIiDBMXCN 

GKBSRXWQCHSHGWGQQNCRHKBXiAGVICSBFHSLRLTSKASRB 

AJCAGRLBVFYKOAJJOTVGKSSMKBTTVOVVCRQLGCADKGKIltP 

ASUDKAMSXPMWVDNVQCPKGPDXLWQCPSSPHEKRIiASPSBST 

H l TCD HKll lLQBGPTSCSGRVBIWKGGST'KnVCTOSHinjIIDAQV 

VCQQLGCGPALBAFKEtASFGQGTGPIWLSBVRCKGNBSSIiVroCP 

AEHHGHSBCGHKBDAAVNCTDISVQKTPQKATltSlLSSftQSSPlA 

VGIIiGVVLLAIFVAIiPPLTKKRRQRQRIAVSSRGEHLVHQIQYR 

EMNSCUIADDLDLMNSSGGHSEPH 

LKWKLKPGGAVWGTGRGAGTGAPHBCCXQTHPGPPSSLRRAPRR 
RBLPFPACHEXGLGA8A3SGP PPAPAARBSRSRAMBEBASSPGL 
GCSKPHLBKLTLGXTRXLBSSPGVTHVTIIEKPPAERHM1SSWK 
QKlWCVPffEIWKNFYXJlIKGPHMrWSVKIJnBHI IPUGSMAIHSI 
SKLTQLTQSSMYSLPN7^PTLADLKIKrTH8ASDDQPBKPHPDSRS 
VI PBIJ5SCNG5GIVnLVYKSGKPArjVBDTEI WFLDRALYHHPLT 
IWPXAYYRIiLI'IHLGLPQWQYAFTSYGIS PQAKQKVSMYKPITY 

htnlltbbtdspvhkldpskvfkskhkivipkkkopvqpaggqk 

GPSGPSGPSTSSTSKSSSGSGNPTRK 

LKMBLHP^AVWGTCTGAGTGAPRSCCCOTNPGPPSSLRRAFHR 
RKLPFPACHK IGLGAEAGSGP PPAPAARE5RSRAMSBEASSPGL 
GCSKPHLEKLTUGITRILBSSPGVTBVTI I BKPPABRHHISSWB 
QKNNCVHPEDVKHFYXiOKGFHHrHSVKLDBKIXPLGSMAXKS I 
SKLTQLTQSSMYSLPHAPTIAPLKDDTHEASDDQPRKPHPDSRS 
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VI FELDS G«3SGKVCLVYKSaKPAXABDTB IHKXxDRALYHHFLT 
IJITTRyXELUTHIinJQVKJfXAPTSTroiSPQRKQRVSHYKPITY 
NTOLLTKBTT^FVNKLDPSKVFKSKNIOVIPnKGPVQPAGGQK 
GPSGPSGPSTSSTSK3SS0SGSPTRK 



PVGWV 






VIXTTSAKRTHIiFFFLSKKAASSRAQfVLJ^LYRAWIjRBSICRFSAXN 
YRTYAVHRI ElDAFRQffiHVKDPVB I QTLVNKAKRDUJVXRRQVH 
IGQLY3TDKLI IBNKEHPRT 



PRGSGSAftBTHPBIRVTPLGAGanVGRSCILVSXAGKWV 
MIiPCt3<H>g FH lJPR R FFDFSYrTQyGRr.TD?IJKVIISHFHLnH 
□QALPYFSBKVGYDGPIYMTHPTQAICPILLBDYRKIAVDKRCS 
ARPFT6QMIKDCKKKVVAVHU57rWQVDDKtiRI XAYYAGHVL(3\ 
AMFQI KVGSBSVVYTGDY»frP13RHIjGAAWIDKC3PNI*I*ITE3T 
Y2\TTI RP S iCRCBSai» I»gKVHBTVBRGGKVI»IPVEMJgSA0BLC 
ILlBTFHBRMHliKVPIYPSTGI/reKANHYYKIiFIPWTWQKIRKT 
FVQHNMFBFKHIKAFIPlAFADNIKn^JVVPaTPGMLHRGOSLQIP 
RKHTUGNBKNMVIMPGYCVOGTVGHKILSGQRKIiKMKGRQVLEVK 
MQVBYWSFSAHW1AKGIMQ1*VGQABPKSVIjI.VH6KAKHMBFLKQ 
KIBQgLRVNCYWPANUKTVTLPTSPSIPVG ISL G M t K RBlUAOGL 
LPEAKKPRLLHGTLIMKDSNFRLVSSEO^KBI/JIABHQLRFTC 
RVHLHDTRKBQ8TALHVYEHL»KSVI>KDHCVQHI*PD03VTVSSVI* 
LQAAAPS KDPGTKVLLVSVTYQDHBLGS PliTSLIiKKOLPQAPS 



QTTRPBPRKSALRQSRSKMAWGVS SVSRIJUQRS RPQXGRPMSS 
GAHGBEGSARMHKTLTFFVMiPGVAVSMLNVYIiXSHEBGHHERPE 
FIAYRHLRIRTKPFPWGIX^HTLFHNPHVNPLPTGY EDB 



TRVGRfiBSQIiStRVKPPPGRPQPDSCJRRRRRRGEEGHDPXEP EQ 
IJdCLFIGGI^FBTTDDSIjREHFEKKGTLTDCVVF^ 

PGFVTYS CVBBVDAAMCRRPHKVDGRWBPKRAVSRBDSVKPGA 
HLTVKKI PVGGlKKOTEBYNliRDYFBKYGKIBTIHVMEDRQSGK 
KRGFAFVTFDDHDTVDKXVVQKYirriNGHKCBVKKAIjSKQBMQS 






GGSRGSYQOGPGGYKOFGGDOQHYGGGPGYSSHGGYGGOOPGYG 

HQGOGYGOGGGYDGYNEGGNFGGGMYGGGGNYNDFGNYSGQQQS 

NYGPMKGGSFGGRS5GSPYGGGY3SGGGSGOYGSRRF 



SCKRRGFIMAIJPIJCRFLYKKIiPSVBGLHAIVVSIJRIXJVFVTKVR 
NIJNftPBHALRPCFLSTRUiATDQGSKLGCLSKNKSIICYYHTYQV 
VQPNRLPLWSPIAgSSAWTGLI V3 LH KKIAPIiFKKLRqWEVS 



ABPGKGAGAIjniCJUiAKABZ^ftSAGGKDCHS PAPKADRFHQRPFI* 
IGVSGGTRSGKSTVCBOMEUJOQ^BVEQRQRKVVTLSOTIFYK 
VLTABQKAKArJfGQYNFnHPnAFDNDI^SHRTLKNXVKGKTVEVP 
TYDFVTHSRLPBTTWYPADWltFEG HiVFYSQBlRDMFHLaiiP 
VDTDS DVRI^RHVLRDVRRGRDLBQII»TQYTTFVKPAF5BPCLP 
7KKWJVI I PRGVDSMVAIHLXVOHIQDIUKH>ICKWHROGSNQ 
RSYKRTFSKPGPHPt^CjTSGKRSHIjBSSSRPH 



ECFJISIiBpJSCriKKAljADAKALVERIiRDHDDAflBSIiIBQTIAIiM 

KRVEnMKQYQKRIQKLNKVARHRPRSTLVMGIQQSHRQIRKLQQ 

enkklrtslhbkqsai*ei>imskyrkqmfrllmaskkddpgiimk 

I^EQH3KlDMVHRinCSEGFFr»nASRHXLEAPQHGLKRRHLBANQ 
HVH 



RHITOLNPVSDGLKLRRKYNGILHCLTT IWKLDGLRGLYQGVT P 
HIWGAGI»StJGLYFVFYNAI KSYKTEGRAERLKATBTLVSAARAG 
AHTIiCTraPIjWVTKTRLMIiQYDAVVHSPHRQYKE3^PT>TLVlCIYK 

ybgvrglykbpvpglpgtshgwuqfmaybllklkykqhinrlpb 
aqlstvbyisvaalskieavaatypyqwrarioxjhmfybgvz 
DVITKTWHKBGVGGFYKGIAPIIURVTPACCITPWYBHVSHPTi 
TJltiBKKkK 
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Amino acid segment containing signal peptrde 
(A~Alanine, C^Cysteine, D- Aspartic Add, B=* 
Glut ami c Acid, F= Phenylalanine, G*=Glycine, 
H«=fli 3 tidine , I=»Isoleucine, 10= Lysine, 
L=Leucine, K^wethlonine, B ^Asparagine, 
P=Proline, Q=Glutamine, R° Arginine, 

S=*Serine, T-Threonine. V=*Valine, 
M=Tryptophan, Y=Tyrosine, X=Un)mown, *°Stop 
Codon, /^possible nucleotide deletion, 
\=poaeiblo nucleotide insertion) 



KAaHLRAIiLI^^RHLRPIjbRAPAIjAAVPGGKPIIjCPHHT'TAQLG 
PRRNPASrSLQftGHbFSTQTAKDKKEPliBSIISSTBSVQqSTSra 
BFOABTKXIiIiDXVARSLySKKKVPXRBLISKASnALEKLRHSlV 
SIX^JRLPEMEIHLQTNAKKG'riTIQOTGIGlCTQHELVSNLGTIA 
RSGSKAPIinALQNQAKASSKTIGQPGVGPYSAFKVAIlRVEVYSR 
SAAPGSLGYQtfLSBGSGVFBIABASGVRTGTKII IHLKSDCKRP 
ssearvedvvtkysnpvsfplylngrrpihtlc^vimkmdpkdvre 
TOHEBPYRYVAQAKDKPR Y TLHYOTDAPI/TTRSI FTVPEMKPSH 
FDVSRELGSSVALYSRKVLIQTKAXDILPKSTLRFIRGWDSKDI 
PU7LSREI«LQBSAI>IRKUUIVLQQRliIKFFID0SKKnABKYAKF 
FKDYULFMR3GIVTATSQKVKEDIAKLLRYESSALPSGQLTS LS 
BYASRMRAGTRNIYYLCAP1IHHLABHSPTYKAMKKKDTKVLPCP 
BQFDKLTLLHLRBFDKXKLI SVSTDIWDHYKBBKFKDRSPAAB 
a^ BHBTBg LK ftHMRNVLGS BVTOVKVTM IJ>lHPAMVlVL5>P 
AARHFIiRnQQLAKTQBBBAQliLQPTUBINPTtBALIKKLNQpLRAS 
EPGIAQIiLVDO IYBMAMIAAGLV UDPRAMVSRUJKLIiVKALKRH 







LLSCHRLAICKI»PF3VBSRKTVMGPQGARHQAFLA3PGDVTVDPT 
QKENRLLS PAQRALYRKVrLHNYSHLVSLGILMSKPBLIRELBQ 
GBVPKGEKRRRRPGPCAGIYAEHVL3PKHLGLAHQRQQQLQPSD 
QSPQSDrCABGQBKBKSTKPMAPSSPPLRHAVSSRRRNSWBIES 
SQGQRENPTBIDKVLKGIEHSRWGAFXCAERGQDFSRKMHVI IH 
KKAHSRQKLFTCRECHQQPRDBqM.T»t^Cai , fglt3BK3YVCSVCG 
HOPS LKANLLRHQRTHSGBKPFLCXVCGHGy TS KSYLTVHERTH 
TGBKPYBOQKCGHRPNDKSSYNKHIjXABSGRKP PVCKBCGRfJYT 
NKSYFVVHKRIHSGB[D?YRCX2ECGSGPSHKBHLIT1KJRTHSGEK 
PPACRQCKOS FSVKSSLLHHQRTHSGBKPPVCKDCERSPSQICST 
LVTHQRTOSGKICPPVCREOGQGPIQKSTLVKHQiraSBBKPPVC 
KDCGRGPIQKSTPTLHQRTHSKBKPYGCfiBCGRRFHDKSSYSKH 
LHAHLGEKRPTCSDCOBflPTLRPNLTXBQBIHSGroPFKCKQCX 
KS PS UU\HIXtHQHTHSGHRPFNCKDCC3?GyXLKS TLLFHQKTH 
SGBKPFICSBCGQGFIWKSNIiVKHQLAHSGKQPFVCKKOGRGPN 
WKfflrLLTHQRTHSGKKPFVaWCGQGFSBKRSLTRHHWRIHSKB 
KPPVCQBCKRGYTSKSDLTVHKRIHTGBRPYB0QBCX3«CFSNKS 
yySKHLKRSLSBKRFCTGSVGBASS 



TKVS KIjLGGS QRI^FLPLWRRIjCROGLGPRVS PMAGPHVEVDGS 
MHGGGQSLRVSTCIiSNLLSLPWRAQRIRAGRSYA 



M3A3SLL^JRPKGQC3flKVQr«3SVHQKDGLNDDDPB?YI>SPQARP 
KNAYTAHSDS YLPSYYS PS IGFSYSLGKAAWSTGODXAMPTLTS 
YGQl*SW Gg P H FLPDAMFG0PGflLQSTPFLGtgg?FM?VPSGXDP5 
AWCRWSSQGQSTQSSGYSSHYAYAPSSIiGGAMIDG^APANBTL 
NKAPGMNTlDQGMAAXiKLGSTBVASNVPEn/VGSAVGSGS ITBNI 
VASNSbPPATlAPPKPASHADIASKPAKQQPKLKTKNGIAGSSL 
PPPPIKHNMDIGTWDNKGPVTVKAPSOALVQNIGQPTQGSPQPVG 
QQANNSPPVAQASVGQQTQPbPPPPPQPAQLSVQQQAAQPTRWV 
APHNRGSGFGHNGVDCBiJGrVGOSQAGSG STP5BPHP VLBKLRS IN 
nywpkdfdnnlkegrvpi IKSYSHDDIHRS IKYNIWCSTBHGHK 
rldaayrsmbgkg PVYLLPSVNGSGRFCGVABMKSAVDYNTCAG 
VWSQDKWKGRPDVRWIFVKDVPNSQLRHIBIiEHNEHICPVTOSRD 

tqevplbkakowlki iasykhtts ipddpshyekrq 
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■Amino acid segment containing signal peptide 
(A^Alanine, CoCyateine, D=Aspartic Acid, B= 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleueine , K=d*ysine, 
l*=Leucine, M=Methianlne, N=Asparagine, 
P=Proline, Q^Glut amine, R=Arginihe , 

S=Serine, T=Threonine, V= Valine, 
H=Tryptophan, Y=Tyrosine, X -Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 


6228 


47 


1970 


grrcrrrgavmklaosarklgcuavbekgvpvaaeapkstlrrl 
CDGQGADIHAYIIX lHVHSQinVKKIRGHLLS?YGHQDSPQVRRKL 
BLBAAVTRLRABIQEIsDQSliBljMBRDTBA^TAHEQAfiQHTQXyT 
QRRALLL&AQAGANRSQQB'n^PMQHIX^liRBLQDHHRIQUCV 
DVTFGSLTSAALGLBPWLRDVRZACTIAAOPLC^LliLPQAKBQ 
SLPTPHDDHFG^SYQQWLSSVBTLLTHHPPCIHVLAALBHIjAABR 
KABIRSLC5GDGLGDTBISRPQAPDQSnSS0rU>SflVHLIQKGH 
RTVGVWSQRSTXJJgaQybTQRLQGLVBEVBRHVXjGSSHRQVL 
XliGLRHCXIIiinrKLKAIiHDQSQgXiQDAAGHRQLIiLRgLQAKQQRI 
LHWRQLVK gTQSQVR 1 .1 >T KHKftAS FCTRLCESPCHV1ALVQHKVV 
PTPBAVAPQSRBLLRCLBBBVRHLPHIIjLGTIjLBHRPGEUCPLP 
TVLPSIHQLHPASPRGSSFIALSHKDGLPPCKASBLIiLPAAASXj 
RQDLL lXQDQRSLHCHDLLBMKTSlf PGLPTQBLLQIQASQBSQ 
QBSOTiGOMiKRIiBKLLKOAbERIPKLQGIVCTWHBQPGQAALSB 
ELCQGLSLPQ>fRLHWVOAQGAIjQKLCS 


6229 


1571 


560 


GPSLLGTRGTPHPAKTXjQIFFI.IIGRRt*rofiMAAVDDLQFKHPG 
HAATSLTAHPnATTVHIBDPGETPKHQPGSPRGSGRBBDDKLLG 
NDDSDKTKLLAGQKKSS P PVfrPBYYQTFFDVDTYQVPDR IKGSL 
LPIPGMWPVHl*YXHSNPDLYGPPriCATLVFAIAI8GKLSNFI*3 
HLGKKTYHYVPBFRKVSIAATIIYAYA^VFLAUiGFU«WKN9R 
VMNIVSYBPLBIVCVYGY3LPITIPTAILH1IPHKAVEWILVMI 
ALGI SOSZiLAMTFWPAVRBDNRRVAliATIVTIVLIiHMLI>SVGCI» 
AYFFDAPEMDHLPTTTATPHQTVAAAK5S 


6230 


1723 


600 


SKMSGRSGKKKKSKLSR5ARAGVI PPVGRLMRYLKKGTFKYRIS 
VGAP VYMAAVI SYLAAE I LKLAGKAARDNXKAR ZAP RHI LLAVA 
NDBKUlQLLKGVTIASGGVIJPRlHPEIiLAKKRGTHGKSRTILSP 
PPEfCRGRKATSGKKGGKKSKAAKPRTSKKSKPKDSDKBGTSNST 
SBDGPGDGFTI LSSKSLVLGQKL3LTQSDISHIGSMRVBGIVHP 
TTABIDIiKHDIGKAItBKAGGKBFIiBTVKBZaRKSQGPliEVABAAV 
SQSSQLAAKPVIHCHIPQHGSDKCKKQLKBTI KNCLSAABDKKL 
KSVAFPPFPSGWrcFPXQTAAOVTLKAISAHFDDSSASSLKNVY 
FLLFDSRSIGIYVQBMAKIzDAK 


6231 


149 


B70 


1*1 F^STMDHSLRNVlWSWlFLlAL^TAYGGljQSbQSSLYSBBd 
LGVTAI*STLYGGMLLSSMFliPPI*I*X BRDGCKGTI ILSMCGYVAP 
SVGNFFASWYTLIPTSXIaZiGliGAAPLHSAOCTYIiTITGHTHAHK 
AGKSOKDMVHQYFGIFPLIFQSSGVKGHIiXSSLVFGQTPSQBTL 
PEBQLTSCGASIXIljMATTTTN3TQRPSQQI*VTTIjIiGrYTGSGVTj 
AVLMIAAFLQPIRDVQRESE . 


6232 


3679 


1476 


PVAGTT>1AGFWVGTAPLVAAGRRGR)# PPQQI24LSAALRTLKHVL ' 

YYSBDCIMVSRHbGSVGYDPHBRTFDKILVAMRGBIACRVXRTC 

RKKGI KTVA1HSDVDA8SVHVXMADHAVCVGPAPT6KSYLNHDA 

ZMBAlKKTttAQAVHPGYGFLSBHKBFARCZAAHDWFIGPDTHA 

IQAKODKIBSKLLAKKAKVNTIPGFDGVVKnAKKAVRIAHBIGY 

PVMIKASAQGGGKG34RIAWDDBBTRDGFRLSSQKAASSFGDDRL 

LIBKP IDNPEHIBIQVIXtDKHGNAIiWIJtKRKCS IQRRNQKWEE 

APSIFUJAKTRHAWJKOAVALABAVKYSEAGTVBFLVDSKKNFY 

FI*BMOTRLQVKHPVTKC3TG1*DLVQBMIKVAKGYPLRHKQADIR 

I HGHAVECRVYAKDP YKS FGLPS ICTI>SQYQBPLHLPGVRVDSG 

IQPGSDIS1YYDPMI£RZ*XTYGSDRTBALKRMADAL0HYVIRGV 

THHIALLRKVI IN3RPVKGDISTKFI*SDVYPDGFKGHMIjTK5 EK 

HQIiLAIASSIiFVAFQIAAQHFQEHSRMPVIKPDIAITHBZjSVKIK 

DKVHTVVASNHGSVFSVBVDGSKUArrSTWiXiASPIiLSVSVrGT 

QRTVQCLSRBAGt23MSIQFljGTVYKVNILTRLAABLNKFMX,BKV 

TJXrrSSVXasPKPGVWAVS vtcpgdavaegqricvi ramkmqns 

PfTAGKTGTVH5VHCQAGDTVGE(3)LI*VELB 


6233 


1 


3654 


hstrbhlnaghfnppseghlvrstgpggsfakhmvaqcvspkgp 
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6234 


■ 


404 


6235 


1 


£ii ' 1 


6236 

1 


1 


703 


m 


312 


720 


6230 


? 


1^6 
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(AsAlanine, C**Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F^Phenylalanine, 0=Glycine, 



H ^Histidine, I°Isoleuci ne , K= Lysine , 
L=Leucine, M=Methionine, M=Asparagine, 
P=Proline, Q=Glutamine. R=Arginine, 

S=Serine, T=Threonine, VaValine, 

H^Tryptcphan, Y=Tyrosine, X=0nknown, *«Stop 
Codon, /^possible nucleotide deletion, 
Impossible nucleotide insertion) 
LACSRTYlTGATBVPYl/jCI^ICLiPKKTBOrRIJjSQIYAAVXEAV 
LAGlACrAKTBSLTKAKKVABQTLGSGUJSFKLIPFKAALHSKM 
TFHXHAVHNQGRXVPLDSKDSLSFVKT7104AVYDI PDLLQGfGC 
UJSVVFSKSFLTSQILVKEIDGTVTTBTBSVVLTRAVPttFCSSCL 
VECNEVKLSKKTH0AVRGDESFLC3TYLTGGEGAYLYBOTLQSWP 
KEGNVHPF3 BGLLPSHCRHGSI I ISHDHJWB I SFYDGDST3TVA 
ALLXDPKSSLLPHLPVHFB33SHFLHIALFPKSKI YQAFYS BVK 
3LWWQQDH3G1SL K V1 Q BD G L3VBQKHLHSSA0KLFSALSQPAG 
KKEtSSLKIJiSAKIiPEgjDWFLQBPAXSSlSQBPVMRTHLPVlibQQ 

ABINXTHRIBSDKVIISIVTGLPGCHASEIjCRFLVTLHKBCGRW 
MVYEQTMBSSKCFHAAHFQRYLSSALKAQQNRSABQSAyrRKKT 
KLLVVLQGYTpyiDVVQfllXyX HP DS W VKASFTIGAITACVBPMS 
CYHBHIU?1 *FPKCLDQCBQGLVSHVVPTSBTTBQHHPLX»VQJjQSL 
mAANPAAAFlLAKNGIVTRNKDrBLILeENSPSSPBMLRSRYL 
HYPGHYBGKLNAGSVYPLMVQICVlfPGRPLBCCRFVAItCKAlQS 
SIKPSPPSGHIYHILGKVKPSDSBRTMKVCrarLRNSLSIMPVL 
KGPTPPPDSKSVSQDSSGQQBCYLVFIGCSLKKDSIKDWUIQSA 
KQKPORKALKTMCLTQaBIRS IHVKRHLKPLPAGYFYMG?TQPV 
HFFtaDKTDFHPLMDQFHNDYVEBAHRSlKJnnSQELBQQBYHDLy 
BLKP 

PRVRKDMDffitSPGHKGSLVYAOIKSIVK33LGMV£3SRHKWSGI» 
DKQSDIQNLNBERI LALQLCGWI KKDTDVDVGPPLN3LVQBGKH 
BRAAAVALPHIJ)IRRAIQrLNEGA33BKGDIJ?LNVVAMALSGYT 
DK KHS3 jWRBI4CSTLR1iQLNNPY 1*CVMFAFLTS BTX5S Y DG VLYKN 
KVAVRDRVAFACKFLSDTQI^IRY IBKLTOBMKEAGNIiKG I LLTO 
LTKIXaVDLMKSYVDRTGDVQTASYCWflGSPUJVIJKDERVQYW 1 
KNYHNIiLOAll3lPWHKRABFDIHRSKU)PSSKPIAQVFVSCNPGG 
KS ISY5CSAVPHQGRGPSQYGVSGSPTKSKVT8CPGCRXPLPRC 
ALCLINMGTPVBSCPGGTKSDEKVDLSKDiaCLftQFNNWPTWCHN 
CRHGGHAGHKLSIfPHDHABCPV9ACTCKCMQLI7rrGNljVPAETV 

QP 

EKRDHRLPSWPRAALKVPGRGGRVGTTPELAAOGlMATRNPPPQ 

DYBSDDDSYBVLDLTBYARRHQWiniRVFGHSSGPMVBKYSVATQ 

IVMGGVTGSfCAGFLFQKVGKLAATAVGGGPLLLQIASaSGYVCI 

DWKKVEKDVJ3KAKRQIKXRANKAAPBIHNLIREATHPIK3QNXVI 

SSGFVGGFLLGIAS 

WDQNKGAAAa3GLTLP3I^SARF3AGPPTQRSRPTMSMMBKHLF 
NIJOrAAKKLSRSAKKCDKBBKABKAiaKKAIQKraafflVARtHAB 
KAIRQJQSQAVHFLRH3ARVT1AVAAH.VOIAVTPCKVT KSMAGWK 
| SMDATUCHQlLSKZSAUIDKPBHQFETIiDVQfTQQMEmMSSTTr 
I LTTTCPQVDM1AQBMADBAGX*DI»NMBI*PQGQTGSVGTS7ASABQ 
DKLSQRLARLRDQV 




CyTONLBNSTtfCAAXKDLQVRACPFLPVKRBBKITTVLQPCDLP 

yqtpqkgtdpovtdmsvksltlkvbpvuktwititsalyttkb 

TIPBBTASSTAHLWEKKDTKTLKMWJLHESHBTBKXAPTrELVP 
KGBH IKMHXDS IPI VLKAGIGHRTVPMLLAKSR PSGEG KNWS SL 
INI*HOQ1£LEVBYYNBMFGVWKPLI»BPLBIDOTEDFRPHNLGIK 
HKKEAKMAIVBSDPRBBNYKVPBYETVISFHSKDQLN1TLSKC3 
LVMLNNLVKRFTSAAlGSSADFVKDbAPFKILNSLGLTISVSPS 
DSFSVLNIPtGVJCSYVLHNGBSl*SMDYiaTKDNBHFNAMTSLSSK 
LPPIIJ»TPVHHSTRDKIPI*TKVGRHI»YTVRHHESGVERSrVOC)I 
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Amino acid segment containing signal peptide 
(A ^Alanine, C= Cysteine, D=Aapartic Acid, B= 
Glutamic Acid, P=Phenyl alanine , G=Glycine, 
BaHistidine, I=Isoleucine, K^Lysine, 

L= Leucine, H=Methionine, R^Asparagine , 

P= Proline, Q=Glutamine, R^Arginirte, 

S=Serine, T^Threonine, V-Valine, 

Vf-Tryptophan , Y "Tyrosine, X«Un)cnowo, *«»Stop 
Codon, /*spossible nucleotide deletion, 
\apossible nucleotide insertion) 



0TVBGSKKVTIRSFVQIRNHFSVPLSVYBGDT1jW3TAS pbrbfh 
IPLGSYRSPIPLKPBDBHYQKCEGIDFBBIIKNDGALLKXKCJtS 
KNPSKBSFLIHIVPKKDHLTSLSVYSBDGWDLPYIMHLHPPXLL 

LDYI^iJHDWKSEyHIKPBQQDISPVSPTCVTSMBKTDLDIAVHMT 
iHTCQTVVAPHSPlTKPfVNKTCRMLQyKAJDOIHRKHPPinfKKlVL 
PSP0PMHPPHMHKVQLWVTDSBL3NQP3TDTVGS HG AVKCBGLK 
W>YQVGVTIDLSSPNrrRrVTPTPFYMI KNKSKYHlSVAEEGSD 
KKLSLDLBQCI PFKPKTASSKLLIQVKRSEDPPKRI Y KNKQOJC 
ILLRLDKRIjGGXIABVNLABHSTVITFIjOYHDGAATPLLINHTK 
KELVQYNQSSLSBIfiDSLPPGKAVPYTWADPVGSRJRLKWRCRKS 
HGBVTQXDDMMMPIDIGEKTIYLVS FFBGLQRXILPTEDPRVFK 
VTreSRKABLABQHIAVALCPVOISLVNNYTKQBVAyiOrTSSD 
VVWBTKPKKKAWKPHSVKHTBKLBHBPHBYTBSSPSBDKyiQL 
DTPVPVRLTPTGHNMHXLQ PHVIAl^UtNYLPALKVKyifTSAHQS 
SPRIQrYRlQlQNQIHGAVFPFVFTPVKPPKSVTKDSAPKPFTD 
VBTVMRSAGHSQISR IKTFKVIaIQEHOLRLOLGPXXALTDLMTB 
ABVTBRTBVBLFHKDlBAFKEKYKIASLVDQSQVSLYByFHISP 
I!0mSVSLSSGRKKAKDSK0KGGLIPVHSIJ3MiLXSIGATLTD 
VQDVVPKLAPPKIJWQFHTTSDLQSKVIHHYSKOAIKQMTVLIIj 
GLPVLGNP reLIBHPfiBGVBAPFyBPYQGAIQGP B BP VB GMRLG 

lbalvggavgglagaaskitgahakgvaamtmobdyqqkrkbam 

WCQPAGFREOITRGGKnLVSGFVStSXTGTVTKPIKGAQKBGAnG 
FFKDVGKGLVGAVARPTGGI ICMASSTFQGIKKATBTSSVBSLR 
PPRPFNEDGVTRPYRLRDGTGKQMIjQKIQFyREKIHTKSSSSDD 
DDDDDDDDBSDLNH 



KPGMAGKGSSGRRPIiLLGLLVAVATVHLVICPYTKVEBSFNLOA 
TKDLLYHHQDLBQyEHLEPPGVVPRTFLGPWIAVPSS PAVYVL 
SLLBMSKFYGQLIVRGVIXJIiGVIpOLNTLQKBVRRHFGAMVATM 
FCWVTAfXJPHUIPYCTRTIiPHVLAXiPVVIiLAIiAAWLfiHBJiARFT 
HL3AFAI 1 VPR V3LC1#PIC1*LI4 iIjAL©?RKVSVVRAI»RHAVPAG 
ILCLGLTVAVDSYFWRQLTNPEGKVLWyNTVIiinCBSNKGTSPLL 
imTSALPRGLGCSLLFl PTX^VDRRTHAPTVLAIjGFKAL YSLL 
PHKKLRFI XYAFPKLN ITAARIXS YIJJWYKKSI^YKAGSLLVI 
GHLVVMAAYSATALyVSHPNYPGGVAMQBLHOLVPPQrnJVIiUn 
DVAAAQTGVSRFLQVNSAWRYDKRBDVQPGTGMLAYTHI LMHAA 
PGIjliALVRXJTHRVLAfiVVGTTGWfiUlljTQIjP PFNVHLQTKLVLL 
ERLPRPS 



HSRGDSLKBPTS IAESSRHPS YHSEPSLBPBSFRSPTFGXSFHF 
DPLSSGS£LSSSUCSAQGTGP3LGQLOSrRSBGTTSTSYKSLAKQ 
1HSGSLSYDSLLTPSDSPDP3SVQAGPBPDPPL0YTSPFLSARL 
AQQRKABRHPRLVPTGPTHH3PSPVRYDSLSRHIVASXOERKKL 
LRQS P PLPGREEB PGLGDSGI QSTPGSGHAPRTSSSSDDS KRS P 
LGHTPLGRPAVPRPGXPDGLRGRGVGSPBPGPTAPYI/3RSMSY3 
SQKAQPGVSBTKKVALQPUjTPKDBVQLKXTYSKSNBQPKSliGS 
ASPGPGQPPLSSPTRCSGVKKVSGVGGriTYRISV 



RNABBKKRI*SLQRBKIIARVSIDNRTRALVQALRRTTDPKIjCIT 
RVKBLTPHUiBPPBGKGVAVXEHI I PYLLRLRQI KDETLQAAVR 

bilaligyvdpvkgrgirilsidgggtrgvvalqtlriclvkltq 

ICPVHQLPDYTCC5VSTGAI LAPMLGLFHHPLDBCKELYRKIiGSDV 
FS©fVXVGTVKK9HSHAPTDLSQTWBNILKDR>fflSAlJ1XBTARNP 
TCPKVAAVSTIVNRGITPKAFVFRNYGHFPGINSHYIjQGCQYXM 
WOAXRASSAAPGYFAKYAI/ 3 NDUJQIX 3 GI 1 J[iNNPSAIAI>!HBCKC 
UiPTJVPLECrVSIXTrGRYRSDVRNTVTYTSLKTKLSNVIlJSATD 
TEBVHIKLDGLLPPITrYFRFNPVMCKNI PLDBSHNBKLOQLQLB 
GIjXYIKRNEQKNKKVAKII^QSKTTIjQKiJJWTKIiKTDMYKGLP 
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6242 


198 


1310 


QHFIjPGABTWSPGAAVCTARRFPGHSLAAFPRPAAPRHAVEMGE 
SSBDlDQMFSTLLGBKDIiLIOSl/aVDTLPPPDIVPPRABFHySV 
GPKDLNBSLHMjKDQDljIiWjMADLVADISBAEQaTIOAQKESLQ 
NQHHSASLQASIPSGAASU^GTHVAATGISQYEIHILPPPPADP 
VXiDLPLPPPPPKPLSQE3BgfiQAK?U?KIKIALKKLKBAK.VRKbV 
VKVHMTOIHSTKSLWVDERQIARDVLDNLFHKTHCDCNVDI9CL.YB 
IYPBLQIBRPFEDHBNWBVLSIJi'frRDTBSKILFIJBKBEICXAVP. 
KHPQHFYIi33NEGKKE9KBTNKKHNAKNKESLLBVRLILQSGRKB 
KDVC3 IFKS PASKNNGO 


6243 


1509 


614 . 


rsasrfsgcwshdstccccpstchsrssascprarwppssapat 
TSRASSRRIAOGPQTRAGABTBSTAM iRRKSAAJIDTRflXTCRSA 
AGTPS PTTtfTCLTDVPTGCAAVB PTARI^AAAHASTITTGCCPA 
MGQAGAGPAGHKGSEAGGGPGRAHHAHPSPLPREPBVHTGPPAH 
6 PTPGSIDPSPHLSHGSAGVTQB 6 PU 1 DFVDFI 1 LPRTRAVDPLR 
RVFFFFYOHLTPPSJQtPQPPPCHAPHPRDPPAGTKRQliIIiVPIJC 
GPPILAPILSLTPILSRWSCYFPRSRIAQGilHLS 


■ 


2119 


1745 


FEHAYASQKriFlX^BSKRCKJUKUJQKTHSLWSKVWQPSKLSK - 
PTHPLFKRNNLVIWPSVAPQSLPIjWEGXFLSUfliJtSSKYLDRAYB 
bwvni ikynkrlo^kvkilrrqiaklbtrcgmqbs p 


S245 


91 


“ hTb 


LSLRNAKYSFPQKLISIiPSWXDIiNCtNlCKRYIKHITNIVIljSLJ 
ICISIAFWIISMTASTYYGNIiRPISPWRHLFSVVVPVLIVSNGL 
KKKSliDHSGALGGIiWGPI LTXANFS FFTSliLMPFLSS SKLTKH 
KGBVKKRU3S EYKBGGQRNHVQVFCBGA.VPTKLRLLYKI BKG PG * 
KIPVDPSKQfYSASHMCLSliLAAIiACSAOTTHASKVGPVLSKSSP 
RLITTWBBVPVG'l'WJGVTVVGLVSSLIjGGTPVGlAyPLTQLIPV 
HDLDI SAPQWPI IAPGGL&GIjIJGSI VDSYI/3ATMQYTGLDBSTG 
MVVHSPTHKARHXAGKPIIiIINNAVHIiFSSVLIAIjLLPXAAWGFW 
PRG 


■ 


1177 


3S9 


SIJ4PWlLHDD3LKQISW}IJjCrVYTA£IFPNC5CSSLCHSSCGQHPV 
OftlHRGAVSNSUffdtKLASQMPLEHrin/QWVPHIJiSHliAnS 
HDCKGVI QKEHFLQ^FLSIJUjPKGCTreHLSNLTILWIjKLLLHI S 
SGKDGQC^ilLRlJXJCLDLLTEJ^KYKHKSSPIxLPLLI PHNVCFS 
PANKPKILANBKVITVUVAdiSSKNQHAQRIGAAAIiHALIlNYQ 
KAKTAUCSPSVKKRVDEAySIAKXTFPSSEANPLNAYYIfKCLSW 
LVQLLHSS 


6247 


3 


1678 


HSRVHGPWTBPSAGSLRPMARKQNRNSKKLGLVPLTDDTSHAGP 
PGPGEAIiIJECDHLRSGVPGGRRRKDKSCSliLV3\SLftGAPGSSFI> 
TOYNLS VVroOTPTIKAinfHBSHERRHGRPI DPDTTLTtUISVTV 
SIFAIGGLVGTLIVKMXGKVI/5ItKHTIJiANISSPAlSAM*LfrfACS 
LQAGAFEHLIVCRFIM3IDGGVAIiSVl*PMYLSBISPKKniGSljG 
QVTAI FldGVyrGQT.TGTtPKTJIyKBSTWYLFGVIVVPAVVQL 
LSLPP1.PDS PR YIxL LRKHNBARA VKAPQTFLGKAHVSQBVBEVL 
AESR VQSS IRLVSYLELLIlAP YVRWQVVTV rVTNACYQIjGGLBIR 
IWFWN3I FGKAGI PPAKI PYVTLSTGG IETLAAVPSGLVI EHL 
OTU?LIiI<XH?GLW<H i P^^ 

IASFCSGPGGIPFILTGKFFQQSQRPAAF1 LRGTVHWLSNPAVG 
LLFPPIQKSLDTTCFLVPATICITGAIYLYFVIiPETKHRTXAEI 
S0AFSKRWKAYPPKEBUDSAVTDGK3WGRP 


6248 


56 


1773 


VP PPRMMAAVPP0LB PWNRVRI PRAGNRSAVTVQN PGAALDLCI 
AAVIKBCHLVILSIJCSQTIiDAKTDVIiCAVLYSKHHEMGIlHKP'HL 
ALKQVBOCLKRIJCHMNIiBGS IQDIiFELPSSNENQPLTTKVCWP 
SOPVVBLVLMKVMDUTKUJliRIiLDCCCKTPIiTVHHIiGLQE PI I 
UJLVMVm#VSRLSfVX*YKBVLKRLlI*LYKPIiPGIIjQKVARIQPMP 
YPKDFTPPSDlTBFLGQPYFBAFRKKMPlAPAAKGINKLUiELF 

i likeqspraseetllgiskkakomkihvqhnvdigqpvknkrvf 
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Amino acid segment con fining a 
(AwAlanine, C=Cyrateinc, D=Asportic Acid, S 
Glu ts m i c Acid, F=Phenylalaniae, G-*Glycine, 
BaBistidine, T>Isoleucine, K=Lysine, 

1*= Leucine, M=Hethianine, tJ=Asparagine , 
P=Proline, Q=Glutamiae, RoArglnine, 
S=»Serine, T=Threonina, Vo Valine, 
W=Tryptophan, Y~Tyrosine, X^Dhknown, *=Sto- 
Codon, /^possible nucleotide deletion, 
\aposaible nucleotide insertion) 



KBBSS BFDVRAFCNQLKHKATQBTSFDFKCSOSRLKTTKYSSQ K 
VXGTPHAKSFVURFREAESFTQLSKBIQMAVVWaiSKKLKAQAI 
FLQ^KLLKSNRIiKHLBRQGTSLPKKLBCXICrSIO iHLIiSGSQ XK 
TSKHHLRQHSSQSKFLRKQHXPQRKL0STLLRBIQQFSQGTRK3 
AIOTSAKSraLSHCTVHRTDIiYPHSKQLiaBGVSMPVTQTKBKMI 
HKHUK?IHENBTI>S®m^IBHM3TSGTIKBTDOlDDIPM«0V 



VPPPRMMAAVPPGIjBP*CfRVEiPKW2iBSAVTVQ£fPGAALDLCI 
AAVIKBCBLVXLSLKSQTIiDABTDVIiCAVLYSNHNHMGKHXPBL 
AUDQVEQCLKRLKNMNLBGS IQDLFBLFSSUQtQPLTTKVCWP 



LNLVMVGIiVSRI*WVLYKGVI<RRLILLYBPIiFGLLQEVARIQPMP 
YFKDrrPPSDITKFLGQPYFKAFKKKMPlAPAAKBIMXIiUJKLF 
UHBQSPRASBBTLIjGISKKAKQMXINVQSHVDLGQPVICHKRVF 
KBKSSKFDVRAFCNQIiKHKATQBTS PDFKCSQSRLiOTKYSSQK 
VlOTPBAKSPVQRFRBABSFIGLSKKICWVVircSSKKI»KAQAX 
PIA3HKLLKSNRIiKHLKAOGTSLPICELEClKTSICUHLLSGSt31K 
TSKHHLRQRRSQHKFUtfiQRKPQRKlX^TLLfiBIQQFSQGTRKS 
ATDTSAKWRL3HCTVHRTDLYPHS KQLLNSGVSMPVTQTKEKMI 
hkhu^qihehktdswtvmqikkhstsgtirbtpp i edi palhov 



LAALHXMALPFRKDLKKYKDIJ3KDKLI/3NL3KTKIiKQIiKTVLDD 
LDPBNALLPAGPRQKWQT3K3TI0 PPDRBHLLSYI£KKAIjBHXD 
REDYVPYTGKKKGKI FI PKOKPVQTPTBKKVSLDPKLKKALTSA 
SDTEZiCDLAAlIXS'OlHLITNTXFCEflMGSSiniVDOBHFSNVVKG 
EKILFVPDHPPNPTNVKESLKRTKEHDAHLVEVNIjNNIKNTPIP 
TLKDFAKALKTHTHVKCFSLftATESNDPVATAPAEMLKVNKTLK 
SLNVKSHPITGVGILALIDALRDNBTIABLKJDTTQRQQI/5TAVB 
LEMAKMI^KrmmKPGYOPTQQG PRTRAANAI TKNNDL VRKRR 
VBGDHQ 



TPGSGPMSAHAAASLSRAAARCZdiARGFGVRAAPPRpPRPSUPB 
PRGCGAAPGRTLHFTAAVPAGHNKWSKVRIIIHBPKDVEILSRI FS 
KLCLfilIRLAVKEGGPNPBHNSNLAHXLBVCSSKRMPICSTXBTAXi 

kmkesketyllybgrgpggssi^eai^nsshkooadihhilnk 

HGGVMAVGARHS FDKKGVIVVEVHDRBKKAVHLERALBMAIBAG 
ABDVKBTBDHBKKNVFKFICDASSIiHQVRKKLDSIXIIjCSVSCAL 
BFlPN3KVQLABPDLBQAAHL10AI>3 HH S W l HV X Wi B 



BSFCTHXAVBVGBMBZAPKPGKDVPPKKDKLQYKRKKPRRYIIBB 
BTVPTTAGASPGPPRNXKBRBLBPQHPXNAYXZiKKSRISKKPQV 
PKKPRKKKNPBSQSGIiSGAQDPFPOPAPVPVKVVQKFCRXDKSR 
KLPBSXAKTRSRXiBVABABBBBTSXXAASSBZiLLAKBPGFLBGB 
DGRDTAKIOOADXVBAVDIASAAKHFDLNLRQFGPYRI^YSHTG 
RHI*AFCX5RHGlrV7\AIiDWVTKKIjMCBINVMBAVRDIRFLHSEALL 
AVAQKRHLHXYDKQGlSLBCIKRCDBVTRLBFIiPFBFXiLA*XASB 
TGFLTYTiDVSVGKIVAAIflARAGRLDVMSCPFYKAVXHIjGHSNG 
TVSLNS PAhntKPLAKXLCHRGGVRAVAVDSTGTYKATSGLDHaL 
ICTFDLEtGTYQPLSTItrLPfiGAGHLAPSQRGLLVAOfGDVVNIWA 
GQGKAS PPSLBQPYLTHRLSGPVHOIiQFCPFEDVLGVGllTGGIT 
SMLVPGAGKPNPDGLBSNPYRSRKQRQKlfEVKALLKKVPAKLIC 
ldpraxabvdvxsleqgkkeqibrlgydpoakapfqpkpkqecgr 
SSTASLVKRKRKVMDBBHRDKVROSIiQQQHHKEAK?UCPTGARPS 
ALDRFVR 



EKFCTW IAVRVGEMBXAPXPGKDVPPKKDKLQTKRKKPRR YWEE 
ETVPTTAGASPGPPRHHinJBRLRPQRPKHAYXLgKSRISKKPQV 
PKKPRHWKNPESQRGLSGAQDPFPGPAPVPVEWQKFCRXDKSR 
KLPBSKAKTRERLKVAEAKEKBTSIKAARSKLLLAKBPGFLBGE 
DGBOTAKXCQADIVBAVDIASAAKHFDLHLRQPGFYRLHYSRTG 
RHLAFGGRftGHVAALDWVrTXKLMCBINVMEXVRDIR FLHSHALL 
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AVAQNRHLHIYDNOGIKLHClRaCDHVmLKFliPFHFLLATASK 
TGFLTYIipVSVGKlVAAUlARACSLDVKSQSPfHAVIKbGRBHQ 
TV^L*iS PAfOCKPIJUaijCHRGGVRAVAVDtSTGTYNATSGLDHQIj 
KI PDLHGTYQPLSTRTLPttGAGHLAFSQRGLLVAGMGDVVNIHA 
GQGKASPPSLEQPYLTOHLSGPVHSIiQPCPPBlJVLOVQirrGGIT 
SMLVPGAOBPKFDGLSSHPSHBRKQRQSrBVKAliLEKVPAEIilC 
LDPRALABVDVISIiEOGKKEQIBRbCTDPOWCAPFQPKPKDKGR 
S STASLVXRJCRKVMDSKHM^KVROSIiOQQHHKKAKAKPTGAR PS 
ALDRFVR 



HALGRRGGSQBLSAAACGCFAIJlbRAPGSGRPAIiAPGAAAFnGL 

ggaprfppsgsaagrtmllxhyrichpltvdkykigqlymiskh 

SHBQSDRGBGVEWQWBPFEDPHHGNGQPTBKRVYLN3 KLPSWA 
RAWPKXFYVTKKAHNYYPYTITBYTCSPLPKFSIHIBTiafKDN 
KOSNDTIFDNBAKDVBRHVCPlDIAamPERTYKBSHDPKHPK 
SBKXt5RCK)I^BGMRDSHOPIKCSTB3jVTVICPBVWGI<yTRVBQFV 
HKVVRDIiLIGHROAPASffVDBWTWfTMDDVHBYSKNMHEQTHnC 
V07QHSSPVDDIBSHAQTST 



pthpqqkllvslatvi fvasqkalsvbskavikqqlbsvsngwt 
V lRXABfiftSRMQHKDMAKBLYOSld/IQVASKHFYFHJJXSIiKBFS 
HABOCLTGLQETENY 35ALS CIABSLXFfHKGIASLTAASTPL^ 
I^FQCSFVKLRXDIibQAFSQLI CTCNSLKTS PPPAIATT 1AKIX. 
GNX)L0ROGRISNDMK0SM2EFR3IiASRYGDLYQASPDADSATLR 
NVELQQOSCLLISHAIKAI*ILDPBSASFQBYGSTGTAHM>SKyE 
RRMMSVYNHVLKEVESLNGKYTPVSYWHTACXCNAIIALIdCVPL. 
S FQRYPFQKIX)STSIKlJd^PSPBOTAKPIAVQNNQC2IiALKVEG 
wqhgskpclfrkiqsvclhvsstlqsksgqdykipxdnhtnem 

BQHVKPHNDtt PSTQPXiLNPAI LOTHNITVBSSVKDANG XVWXTG 
PRTTI PVKSLBDPYSQQIRLQQQQAQQPIiQQQQQRNWrrRF 



CRQAQABPAAMPRSPRSLVPSliBSTSTSVPPAKrrMATDSW 
VDBQBAAABSItSNLHUCESKJt KFDTHOAWKTHANAEKTDBERK 
BDRAAOSbliN^XRSNLVBimJaVKVLQJlDPNSPLYSVKSFKHL 
RlJtPQIdXXiVYAHGFNRPSKIQENALPLMlABPPQin>IAQSQSG 
TGKTJUVFVIAMLSGVEPAHKXPQCLCXiSPTYBLALQfTGKVXEQH 
GKpypBbKLAYAVRGNKLBRGQKISKQIVIGTPGTVIiDHCSKLK 
PIDPKKI KVFVLDSADVMlATQGHQDQSIRIQRKtjPKHCQI'tLbF 
SArPBDSVHKPAQKWPDPHVXKLKRBBBrLDTIROYYVICSSR 
DBKFUAIjCMLYGAXTIAQAMIPCTTK K T A SIfltRAgLSHIEGHQVA 
LLSGKMHVSQRAAVXSRPltBGKKKVbVTTNVCARG IDVEQVSW 
nJFDLPVDKDOiPinJBTYIiHRIQlTGRFGKRGLAVlIHVDSKHSM 
CTILNR IQSHFNKKIKRUrrDDLDHIEiOAN 



AFIPAMARblQE^jQCSVKKYQQLQKDLSKSMSaRQKLKAObTE 
HHIVKEBLAIJJX^HVVFKUiGPVLVKGBLGBARATVBKRLDY I 
rARXKRYKSQLRBbKRQSEQQRBTTAQLQOBFQRACAAKAGAPG 
KA 



: j i?!i r L‘ ): * ‘i 1 
^ " rtf < i'l ij 1 1 y? : ii iiif rt’ma 



ILEKGFPSQCHPERKHKVDDVLBSSQBHEDDUFKBLLFHNNiCIV 
SVBNGDRGSKTPIW/nT>PVSLRNYPYKICBSCBMI^LK2?lSGI>I I 
SKKinSRKKPDKFKVCEKLLLDXRHBKIPIGBKSYKYDQKRNAI 
NYHQDLSQPSFGQS FEYSKNOQGPHDBAAFFraKRSQIGBTVCK 
YNBOGRTPXSSIxKLNTSQRPRLEHRpyGCSlCEKSFCMHLRFGH 
QRALTKDSrPYKYNBYGBIPCDKSAFI IHQGAYTRKlbRKYKVSD 
BTWBKSAIiLKHQIVHW33K3YDVNEHGSlfPSKKSHIjTQIjRRAHT 
GEKyFBOGBOU K T FW K K SKbTQHQRPrPGSKPYECTSCGKAFOO 
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Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, B= 
Glutamic Acid, F= Phenyl alanine, G=Glyclne, 
&=Hietidine, I=Isoleucine, K= Lysine, 

Leucine , M=Methionine , N=JVsparagine , 

P~ Proline, Q=Glutamlne, R=Arginine, 
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W=> Tryptophan, Y=Tyrosine, X=Un known, *=Stop 
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\=poasible nucleotide insertion) 








kphlinhqrtht^kpybckqcckte ? cvksnLtSiqrthtgbkp 

YEC33AGGKSPCBIlSALTVH®THTGKKPPICIlBC3GKSrCVKSNIj 

lVHQRTHTGBKPyK«3fECGRTPCEKaALTKHQRTHTGBKPYBC» 

ACGKTPSQH3VLTKHQRIHTRVKALSTS 


6260 


20B1 


1436 


"gtcpr^ 
RFVBDSFDPHINPTIGASFKrKTVQyQHKMKFLIVnJTAGQBRF 
BALAPMYYRGSAAAI IV Y PlTKHgTFSTLKNWVKKLROHGPPNI 
WAIA£3'TKCDLrBVRBVMBRDAKDTADSIHAlPVKTSAHNAINI 
NHLPIB 3SRHI PSTnANLPSGGEGFKLRfiQPSBPKRSOC 


6261 


3 


lies 


PHYRLGPGTRSRWPRRGSNAASLVPRGP3PAALVTS PCPPDPLR 
SPACKPCRPDFAPRPALLIiRSapRSAPAVTGKPAIiKEQPGPWPG 
MABVSXDQSKLPGVKBVCRDPAVIiKDHTLRHSIjQBQKIEHHLiAS 
NVQRNRLVQHDUJVAKQLQEEDLKAQAQLQKEYKDLEQQDCE1A 
QHTQWTTAT^aRBBHTQMIfnRnjjUn.TfflgBT^gWKyff^HP» 
BP PATRAXADSYYYKDGGMKPRVMKKAVSTPSBMAHRDQBWYDA 
BIAIUCLQBBBLLATQVDMRAAQVAQDEBIARLIjHAEEKKAYKKA 
KRRKKSSLDKHKQDPBWKPKTAKAANSKSKESDEPHH3KNKRPA 
RPPPPIMTTDGHDADYTHFTNQQSSTRHPSKSK3SHKGFHYXH 


6262 


2 


1759 


PBCHSQGLCSVHRPGEVPOASRISGLVLGQROBPAl^RLSQBEab 

gstrlvsqglhalrsbhqavlosi >sqti kclqqgghsbglvhkk 

ARQLRRSMKNIBLGI^SKAOVMLAIkASHIiSTVKSKKQKUiAQVRR 

LCQENQHUa^KIAGTQQ^U-QRSROAVAQLEBEXKHIiEPLGQLRQ 

yDEDGBTSKBX£GIU\TKDSl*Xn)ltFP!NBEBBDPSNGLSRGQGAZA 

AQQGGYBlPARLRlTJINWIQYAAQGRYBVAVPLCKQAIiEDIiER 

tsgh^zpuvathlnxlalvyrdc^kyrbaahlutdalsxrbstl 

GPDHPAVAATLNNLAVt.YGKRGKyKKAKPLCORATiEIRKKVLGT 
NHPDVAKQUIKLAIjLGQNOG KYKAVER YYQHALA3 YBGQLGPDN 
HJVARTKNNIASCyLKQGKYAKABTLYKBILTRAHVQRFGSVDD 
DHKPIWWHAEHRKEMSKSRHHKGGTPYAKYGGWYBCACKVSSPTV 
NTTIASIXD&YRRQGKLBAAHTIJSCALASRSQGTDPISQTKVA 
KLLGBSDGRRTSQEGPGDSVKPEGGBDASVAVKHSGDGSGTIiQR 
SGSLGKIRDVLRR 


6263 


1 


2408 


ftBLDSLADLPHRIKPPYAHGLSTSHLRSSSVBDVKIiI isegrpt 
IBVRHCEMPSVICKaTKQFQTISBBSNQGSIJt.TVPGOTSPSPKP 
BVFSNTPBKDLSNVSNIHSSFATSPTGASNSKiySADRNL I KNT 
APVNTVMDSPVHLEPSSQVGVIGNHSWKMPVDHLBrLSTRDP I C 

pksnipdqksslqspcnsknkvlkkkadflslrqtklpqiscaq 

DPASFMPPQQPCSPPSQSLSDAESlfiKHMSLSYVAWOBPGILQQ 
KNAVQI TBSAIJXHKJBSTKITraOTFVIiGDVQKTDAFVP'VYSDST 
IQBASPNFBKAyTLPVLPSKKDFNGSDASTQLNTHYTiPSKLTYK 
3S5GHEVBKSTTDTQVI SHBKKNKLES LVLTHLSRCDSDLCKKt? 
A^PBXnUdSFBQDFKHCPBSBKOUiS IEDSBSOOS ILS5LHNHSQ 
QSTQPEMHKYGQLVKVKLKENAKDDKTBNQI PQRHTRNKANTMA 
NQSXQILASCTLLS8KDSRSSSPRGRXRLTKDD1)PQIHHPHKRK 
VSHVPQPVQVSPSLIjQAKBKTQQSLAAIVDSLKLDHIQPYSSBR 
ANP YFHYIAIRKKIEBKRKLLCSVI PQAPQ YYDBYVTFNGSYLI* 
DGWPbSKICIPTITPPPSLSDPLKELFHQQHWRMKLRbQHSIfi 
RBKLIVSNBQBVLRVHYRAARTLAliQTLPPSACTVIiIJ>ABVYNV 
PLDSQSDDSKISVRDRFNARQPMSWZiQDVDOKFDKliETCIiIiMRQ 
QHKAAAIMWQRLBWQXjKLQBLDPATY KSIS IYBIQBPYVPLVD 
VNDDPKLTPI 




143 


19&0 


KHRQEHNAtJ3MAPKIA>rn5PfKXIBNTNGBLVAOTKAI*KILSAZ 
TQFVVWAZWLYRTGKSYLMNKIAGKNKEF3LGSTVKSHTKGI 
WfflCVPHPKKPBHTLVIiLPTEGbGDVKIDBDHQHDSWl FTLAVLL 
SSTLVYNS!«TINQaAMDOLYYVTKLTHRIRSKSS PDKNKNKDS 
ADFVSFFPOFVHTLRDFSLDLSADGQPLTPDKYIiKYSLKLTQGT 
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Aqiqq acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid r P=PhenylalAninP! r GsGlyoine, 
H=HiBtidine, I»Isoleucine, K=Lysine, 

L= Leu cine, M=Metbionine, N=Asparag ine , 
P^Proline, Q=Clutamine, R^Arginine, 

SoSerine, T°Threonine, V-Valine, 

W» Tryptophan, Y=Tyxoaine, X^Unknown, ‘=3 top 
Codon, /opo3sible nucleotide deletion, 
\=*po3sible nucleotide insertion) 








SQKDIQfF^PRLCIHitPPPKXXCFVFDLPiHRR^AQLKKLQDB 
KbDPEFVQQVADPCSYIPSNSKTKTLSGGIKVNGPRLSSLVLTY 
INAISRGDLPCMBNXVtJdiAQ IEE1SAAVQKAIAHYDQQHGQBVQ 
LPABTLQSLI1)LHRVSBRRATBVYMKNS FKDVIIHLFQKKLAAQI* 
DKKRDDFCKONQSASSDRCSAIiLqVIPSPLBBBVKAglYSKPGG 
YOiPIOKIjQDLKKKYYBBPRXG ioabbilqtylkskbsvtdail 
QTDQILTBXEKB 1KVKCVKAKSAQASAKNVKKHQIKTO3NNKBK 
BKSYOBRVKQb78KnBRBRAQljtL8BOBRTLT5XLQBOAHVZ«KBR 
CQGESTQLQNBIQKLQKTLKKXTECRYMSHKLB[I 


6265 


141 


1960 


KlUtQBKftALDMAPBIHMTGPMCIJLENTtCGELVANPBALKILSAI 
TXJPVVVVArVDLYRTGKSYUlNKIiAGKNKGPSIjGaTVKSHTBDI 
HWWCVPHFKXPBHTLVLLDTBGLGfl)VKKGDIiQNDSRI FTLAVLL 
SSTLVYHSMGTINQQAMDQbYYVTKLTHR IRSKSSPDSNHIfBDS 
TiDFVSFFPDFVITTLRDPSUJLBADGQPLTPD'BYIiBTSLKLTQGT 
SQITOKBFHIiPRllJRXPypKKXCPVPDLPlHRltKLAOLKKLDDS 
HLDPRPVQQVADFCSYTFSNSSTXTLSGGIKVNCPRliBSLVIJrY 
IHAISQGDLPCMENAVIJdjAQIBNSAAVQKAIAHYDQQMQQKVQ 
I^ABTLQKIJJililRVSBRKATBVYMKNS PKDVrDHLPQKKLAAQL 
DKXRDDPCXQNQBASSDRCSAIiIjQVIPSPIiBKBVHAGIYSKPGG 
Yd«PIOSIiCH)liHKKYYBBPRXGlOAEBI]jQTYLRSRBSVTDAll* 
QTDQIItTKKKKBlHVBCVKAKSAOASAKMVBEWQIlCrQQMMKEK 
BIlSYQBHVXDLTBKMBRBRAOIiLEBQBXTLTSICMlBQAHVIiKER 
CQGBSTQIj(^BIQKLQKTLKK ktkr ymshklki 


6266 


276 


1421 


GS HQKQMLVlKIPljYsijONRKPSLYGSl.XCQG IGLDG X PEEVTAS B 
GFTVNEINKI«IHIBCPKBNASSm*APYTPFSimiTICBITC3J) 
ISSRGGIXJVBSSTIXJTMKIKQASRaBLRRVLEQHVPDVNCCRFP 
PSOIjVVLSGGMX^LjawSAEDASCVVTFKGHKGGILDTAXVDR 
GRNVVSASRDGTARIjWDCGRSACLGVIiADCGSSINGVAVGAADN 
SIHlGSPBQHPBEREVGTBAKMliIjbAREDXKIiQCUQliQSRQIArF 
IiFIGSDAYVCCrFliSGFliljliAGTQDGNIYQLDVRSPRAPVQVIH 
RSGAPVI>SLLSVRDGFIASQGOGSCFIVQQDXJ)YVrELTGADCD 
PVYKVATWEKOTYTCCRIXILVRRYQIiSDI* 


6267 


3 


£22 


NRYWLKGDQLYISKKKVKDEKlflQKVFDIjSDYEKCBBIiRKSKS 

RSEKNHSKPTl«ABSKQP<SrrAPm»IFIiAV5PSBKBSttXNALKSA 

ITBAKHRXXDEVTVBSDSYLAHPTRDRAKIQHSRRFPTRGKLNA 

VASrSTSDOCLTUJLIQEKDPSPKBPTSLC 


6268 


160 


1368 


HRKXjCQNLPAGJUSSALI Dtf PLTLLJ6S IDTYVMLQBFVTFQDVAV 
DFSREEHGUjGPTQffTBYRDVMLETPGHLVSVGffBTTLRNKBLA 
PHSDIPBBBPAPSLKVQBSSRDCADSSTtiDTLQGGVQBVQDTV 
IaXQJTBSAQXKDDPQKXHFDNRBSQANSOALDTNQVSIjQKXDNPB 
SQANSQAIiDnTRQfVLLHiLI PPRKRLRKRDSQVKSrtKHNSKVKIHQ 
KSCERQSAKEGNGCRKT P3RSTKQ ITPIRIHKGSQVCHCSKCGX 
I FHHFRYFSVHKKXHTGKRP YVCQDOGXGFVQSSSLTQHQHVHS 
GBRPFBOQKCGRTFNDRSAISQHLRTHTGAKPYKCQDCGXAFRQ 
SSHDIRKQRIHTGERPYACNKOGRAFTQS8HLIGBQRTHNRTRR 
KKKQPTS 




2886 


1449 


HASAPTRKNMAAAS PLRDCHAWKDARliPl^TTSNEACKLFEiATL 
TQT\nWTiroKSU^IBGCLSKIiXAADPTFVMGHAKATGI»VI*IGT 
GSSVKLDKELDbAVXTN^ISiniQPLTRREQLHVSAVBTFANGN 
FPSAJCBLWEQlM3DHPTim>ALKFSHEkAYFYLGYQEQMRDSVAR 
IYPPWTPDIPLSSYVKGIYSFGI^4ETNFYDQABKL»AKEALSINP 
TDAWSVHTWHIBEMKAB IKZ)GLBFMQHSBTLHKDSIMLACHNY 
HHHALYIiIEKGEYBAALT IYDTHIIiPSLOANDAMIiDVVDSCSML 
YRI^MEGVBVG0RWQDVLPVARKHSRDHIIiLFNI)AHFI2>iASIiGA. 
HDPQTTX)B^7TlJUXASBSPGBNCQHIjr»ARDVGI*PCCQAI,VBAB 
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Amino acid segment containing signal peptide 
(AaAlanine, C= Cysteine, D-Aspartic Acid, B= 
Glutamic Acid, F=Pheoylalanins, G=Glycine, 
H^Hietidine, I=Isoleucine, K^Lysine, 

L= Leucine, H=Nethionine, N=Asparagi_ne, 
PaProline, Q=Glutamine, R=Argioine , 

S=Serine. T=Threonine, V & Valine, 

W^Tryptophan , Y«Tyrosine, X-Onknovn, *^Stop 
Codon, /-possible nucleotide deletion, 
\»p033ible nucleotide insertion) 


n 






DGHPDRVLEIJJL^IRyRIV<ljWK5SNAQR0VTNQLLIHAAliNCTS 

SVHKHVARSIJjKKEDALKPNSPLTKRLIRKAATVHLMQ 


6270 

| 


23 


2086 

i : 


SVTVnfiSeGO^PPTYHLBEMBOBP^^BPAEIiaiRBAyKKA 

PLPVNKGLWTOKLGQSBBAKHYYKOGIGHliIliGlSISSKESBtfr 

gpgkbsasqkqqkmketlgbvrtrlbilbkglatslqndlqbvp 
KLYPEFPPKDKCBKLPBPQSFSSAPQHASTW aiTSTPSAGAVAA 
PASLSLPSQSCPAEAPPAYTPQAABGHYTV3YGTDSGEFSSVGE 
BPYRMHSQP PPLBTLGLDtADBL H»I PNCWQI PFVNPAGEVSAPS 
ypGYLRIVRFlJ3WSljl3TVI2TOPPGPLQVCDWLTfPLVPDRSPVLK 
CTAGAYM PPDT>CjQABJSCFVGVVI>SS elpkddrhlfbdli^qhs 
DLPLQANWNRABEENBPQIPGRTRPSSDQLKEASGT 0VKQLDQG 
HDJVRHKGKRGKHAKDTSSBBVNLSHJVPCBPVPSKKPKBLPBW 
6BXVAlfflILSGASHV&KGLVKQABITGKAIQKGASKLRBfaOPB 
BKPVBVg P AVTR3L Y1ABQA7GGAAKVSQFLVDGVCTVANCVGJC 
BLAPHVKKHGSKLVPBSLKKDKDGKSPLDGAMVVAASSVQG PST 
VKQGbBCAAKCIVHtrVSABTVQTVRyKTGTHAQBATHHAVDSAV 
HVGVTJVWlHNIGIKAKVKK'rATQTGHTlXEDYQIVDHSQREHQ 
BGAAMVNVRGBKDBQTKEVKKAKKJCDK 


6271 

1 


32 

| 

1 


I55a 


(XaTVKTAGMVGRBKKLS 1HFVPGS CRLVKBBVKI PlfRJR vLVTCA - " 
TGLLG5iAVHK8FQONNVTHAVGC3G PRRARPttFBQVHIJjD SHAVHH 
1 IEIDPQPHVIVHCAABRRPDVVEBQPDAASQIiHVDASGHIiAKBA 
AAVGAFLIYISSDYVFIXnWPPYHKEDIPAPLNLyGKTKLDGIfK 
AVLKNHLGAAVI21XPXLYGHVBKLKBSAVTVHFDKVQFSNKSAN 
MDBWQQRFPTHVKDVATVCROlAEKRMM)PSIIC3GTFHWSGMBQM 
TKTKKACAX?mAFKLPSS}fl>RPITBSPVmAQTU>RN7U5LDCSKr> 
KTIOrGQRTPPRlGXKKSLWPFLIDXIWRQTVFIl 


m 


ITTS ~ 


S75 


GAVMKDAAAPGRTBGVliKRQaAPPAAGaGGALVELTPT PGGLAL 
VSPYHTHRW3DPI£)LVALAEQVQKADBPIBAHATNKLTVIAKQX 
QE[LQBQARKVLEDAHRDANliHHVACEnVKKPGITIYYIiYKRB5GQ 
QYP3 1 ISPKEgGTSCPHDPIiGAYKLOHPLSWTP YKDIBKQDAKI 
SMMDTLLS QSVALPPCTBPNFQGLTH 


6273 


256 

1 




SCPRV^PBa^U^VMFSU>Il^PDHXRRGSC3<GRPC)DLKIA 
SAAHBSKCHPGAGAAHARQHARXI»WYDRPRYVPMBPCVKD3TPy 
HVLXBDHRI VPS CKNADGVBLYNBIBFYAKVN3KDSQDKRS3RS 
ITCPVRKWKBKVAH PRLTKKDIKPVWLSVDFIjWWRDHBGDKBMB 
IAHVEHXABVHDHTyCVLPT 


6274 


56 


1142 


AAAAMAAAAGGGAQAARSLSRFRGCLAGAU13DCVGSFYKAHDT 
VDLTSVLRHVQSLE PDPGTPGSBRTEALYYTDIXrAMARALVQSI. 
LAKKAPDKVWAHRPAQBYKKDPDHGYGftGWTV kKKUHPKCR 

dvfbparaqpngkgsygnggamrvagi slayssvqdvqkfarls 

AQLTHASSIXTWfGAJXQAIAVHLALQGBSSSKHPLKQLLGHMBD 
LEGIlAQSVIinARKLGMRBRpySSRLKKI GKliIiDQAS VTRKBVVS 
BliGBGIAAPBSVPTAIYCPLRafBPDPBXPSAPNSLQRTLITSI 
SI/3GI)TDTIATf4AGAlAGAyY(31DOVPBSWQC3SCBGYBBTDIIA 
QSLHRVFOKS 




20 




SRiaiR^CXARGSERPVPRPAKTMAPMVKTWGGQIjKNLTGSLa 

ggedkgmdksaakaqghsreeyeetqkqlvkbhmbrdaqftqr 

KABHArLRSHPRDKYRLPKHETOKSQXQKAGGXJVELPRBLAKMI 

BBDTBRBKBKASVLGQLASLPGI^LGSIiKDKAOATLGDLKOSAE 

KCHVN 



627$ 797 97 TIJ^bPPLPDTBGHILI2raffl»BGTVAEHPVPIVHTPSGNXLTLB 

SCIJOOIATHPGHTOIHLQIABPAAIJU’SLMjIiABXSSLGLLHWP 

VWVGAKISHGSPSVP^VftGHKLLTAVAKVFPHVTVAPGWPKBV 

LGSGYEH^IiTDMlJSLOQGIiffQFVSFQHQAMklXiHSTAfSVIGRL 

iasspbatvtvehnpaggdyasvrtaliaaravdrtrvtxhlpq 

GYHKDLLAHVGRK 



478 

















. WO 01/53312 



PCT/USOO/34263 



[ 



“Predicted 
beginning 
nucleotide 
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amino acid 
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Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 




(A=Alanine, O*0yateine, D=Aspartic Acid, E«= 
Glutamic Acid, P= Phenyl alanine, G^Glycine, 
H^Histidine, Ialsoleucine, K<=Lysine, 
Imljeudoe, ^Methionine, H^Asparagine , 

P- Proline, O^Glutaraine, R=Arglnine, 
S=Serine, T=Threonine, VoValina, 
W=Trypfcophan, YaTyrosine, X-Dnkncnra, *=»Stop 
Codon, /=possible nucleotide deletion, 
\«po8sible nucleotide insertion) 



MAFRTKMGLYYSYFKTIVBAPSFRfGVKMIMWDKLTPyPLVTNT 
LKRi^YPgVlljASWYRIYITCOfflLIGIQfTK ICTTV TIGBGLSP 
TBSCBGIGDPACPYVAVIFliroumALKPl YGTYLSGSRLGGk 
VTVLCFFFSHGKCTSVKWTPPIRBSPSYPFLVI^JMLLVTHIIAA 
TKtYRGS3jIALCISNVFFMLPWQPAQPVLLTQIASLPAVYVVOy 
IDICKLRXIlYJlOCUnALCPVIjHPGilSWjLTSYYASSLVlIWG 
IMMXPHPI^HVSKI^UAaQGaTniFGTmjnrLTSKIPGXA 
NDAHIGNLLT3K?PSYKDPiyrLWrrCAABFTJPMRK3TPLR'rrKr 
bhLPWLVGPVAIVRKI I SDHWGVIAKQQrrHVRXHQPDHGELVY 
HAI^ZJAYTAIjGIIalMRX*KIi?I)TPBMCVMASl*lCSRQLrcUIiFC 
KVHPGAIVPAILAAMSIOGSAffljarOSHIVGBPSHLPQEELIBW 
IKYSTKPDAVFAGftMFTHASVXLSMAPIVHHPHYEOflGLRART 
(dVYSMYBKKAAKEVKRBLZKLKVHYYIlSBSKCVRRSKPGCSM 
PBIWDVEDPAHW5KTPLC!JI»I»VKDSKPHPTTVF®ISVYKVLBW 
KB 



ILPRLVLLShVYLIjNSVATKBIUQPABWLdVBGOQYAVVGTVLIJL* 
XRXXIigYCQGVDglPSVTTDMIiTRLSPIiIiKYFM aRSCO LVLGAG 
AliQVVGLKrnTKHLALSSRCIXiljIVHYI PVTRAHFEARLPPKQ 
ySKLRHFDHTTiaJYmHlABISAKLVAIMDSLPDKLbSKYBVKR 

pvpsacprnxckqktkmhhaipdllpeeotqmlplrxnasyklh 

LKKQLSHI^VlNPG G PQMGLVTADVAFYTGHbQALECgliKDIiDUT 

MABIWBQ3R 



GGAMASDGARKQPWKHSNSKLPGSlQUVYGAOdPPFDPnUiGTti 
LASTAKMPTTPVKAKRVSTFQKFESNTSDAlfDAGBDDD KLLAMA 
akslnsevvmktanrvlrnhsqrqgrptlqbgpglookprpbab 
PPSPPSGDLRLVKSVSBSOT 3CPAB£31SDAAPLQRSQS1J>KSAT 
VTU3GTSD PffTLSSSALSHREASRLDKFKQU^AG PNTDLEEXRR 
LSWSG I P KPVRPMTHKLLSGYLPAHVDRRPATL0PKQK3YFAPI 
KHYYDSROTHVH0DTYKQIBlDIPHM3PEALIbQPKVTB3 FBR I 
IJIMAIRHPASGYVQGINDLVTPPPVVPICBYIKABBVIXrVDVS 
GVPABVLCHIBADTYWOlSKIiljIXSIODHYrPAQPGIQMKVKMLE 
BhVSRII>BQVHRHIJX^BVRYLQFAPWtl«IILljMHBVPWlCTia 
LWDTYQSKPDG F5HFHLYVCAAFLVR WRftRH «B KKD PQKLIiIiFL 
GHLPZAHNDDBDISIJjIABAYBLKEAFAIAPNHYKK 



B 

DTOLAQVIAYIJjKRGQVRLVQGGGAANWPIOALLDSKEENDHA 
OTX^RIvGDPYKPPVEATPDTKBLBPKBIKTGVKIATGQLGLRRAA 
QKUSFTHMLaQRKRGIiCHSGSPSIiGBQSHVISHKLPNDbGFTIJS 
YSQKAFOOIYSKDGOlPWSAOQDQTlRlYDCRYrattPHKFKSIlCA 
RDVGHSVIJWAFXXMmFLYSSBSDyiHICNIYGBGDTffMl) 
IiRPDBRHFAVP3IAVSSl)GREVI/K3AN!DGCI»YVPDRKQHRRTIO 
IBSHKDDVHAVAFAD1SSQILFSGGDDA1CKVNDRR7M&BDDPK 
PVGALAGHQMITPIDGE0G33ARYLISNSKDQTIKLWDIHRPSSR 
EGMEA£ROAATQ®WDYRBQQVPB3AHBKLKI*PGOSSLIftYRGH 
GVLHTLIRCRPS PIHSTO0QPIYSGC3TOKVVVYDLI»SGHIVKK 
LTOHKBCVRDV3llHPFBBKIV5SSHD®IJU»WQYRtJftKYFQDDM 
PB3EBCASAPAPVPQSSTPPSSPQ 



ECCTOKMGSHMSSSflGSGSGDPSEGLPaBGft GIg a S BS B BgBDB 
DVDLAQVIAYLLRRGCJVRLVQGGGAAKbQPIQAIiLDSfiBBElDRA 
WDGRLGCHYNP PVDATPIH’RELEFltBI KTQVBLATGQIiSliRRAA 
QKHS PPKMLHQRERGLCHfiGSPSLGBQSRVISHFIiPNDLGFTDS 
YSQKAFOGIYSKDGQlPNSAGQDQTIRhYDCRYGRFRKPKSIKA 
RDVGWSVTJJVAFTPDGHHFbYSSWSDY’IHXCWIYGEGDTHTALD 
LRPDERRFAVFS IAVSSIXmEV|jGGAHDCICLYVPDRB®JRRTLQ 
IBSHEDDVNAVAPADISSQIXiPSGGWlAICKVWDHRTMRKDDPK 
PVGAIAO^JTOITPlDSKG11ARYI*lSHSKBQfriKLWDXRRPSSR 
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Amino acid segment containing signal peptide 
(A^Alanine, OCysteine, D=Aspartic Acid, 
Glutamic Acid* P=Phenyl alanine, G=Glycine , 
H=Histidine , 1 = Isoleucine, Lysine, 

I>= Leucine, fcttethionine, H=Aapaxagine, 
P=Proline, Q=Glutamine, RaArginine, 

SsSarine, TaThreonine , V« Valine, 
w=»Tryptophan, Y=»Tyrosane, X^Onknovn, *»Stop 
Codon, /-possible nucleotide deletion, 
\=posaible nucleotide insertion) 



EGMEASH0AATO»nTOYRWQQVPKKAWRJaJaiPG31SSLMTYBGH 

GVLHTLIRCRFSPIHSTGGQFIYSGCSTCKVVVYDLLSGHIVKK 

LXHHKACVRDVSMHPFSBKIVSSStfDGEnALWQYROABYFQpQM 

PESESCASAPAPVPQSSTPFSSPQ 



RWAA£EALKAVLVDLOTTLH 3 BIiftAVPGAQSALKRL 8 GASvI IR 
FVTHTTRBSEGQDT.TiRRTtgKItRFDXSHDglETSIjTAARSItLKRKq 
VRPMLLVDDRALPDFKOIQTSDmAWKSIAPBHPHYQ I LNQAF 
RIJajX 5 APl*IAXHKARTYKRKDGLALGP(JPFVTAIiEYATDTKAT 
VVGKPKKTFFLBALHGTGCBPKfiAVH IGDIX^RBDVGGAQDVGML 
GIL V KJG K YRASDHEKlMPPPYLTCESFPHAVDHIIiQKLL 



LSLrciHVMHPFHSMSTSSVRKRSBGSBKILTGDVKrSPPRTAP 
KKQLPS IPKNALPITKPTSPAPAAQSTNGTHRSYGPFYLKYSIiL 
ABPTLVVKQKLPGVYVQPSYRfiALMHFGVl P 1 RHGLYQDGVFKP 
TVYI PDWYPDGDCPRLVPDIPVFHPLVDPISGKLDVKRAFAKWR 
RNHHHX 8 QVLMTARHVPYKIDTA 5 PLNPHAAVLYKKDIQLPBCSK 
WDSVKVCTARLPDQPKIRPPYAIS PSP«NPSVHDKAREKMLTQ 
KKKPKSQHNKSVHVAGLSWVKPGSVQPPSKBBKTVAT 



RSVZPGSTlSSRMPGLSfiPRFMAAHEWWFQRKKLIGQISDIRV 
(fflLQVBHBMVOKHTFrHWlULHIiBKCHPPLgVKPLFVDIQDGKI 
UlALLBVXiSGBBldABSncSSSHRlFSUlHIAXAliRFLBDlSNVKL 
V 3 IDAAHIADGHPSLVLGL I NNX ILFFQI KELTGNLSRNS PSSS 

lapgsggtdsdssfpptptakrsvaisvkdqrkaxkallawvqr 
K r RX T g VRVQDFAGSWRSOLAPLAVXKAlDPSLVDMKQALEMST 
SEKLSKAPSIAQDALHIPRUjBPEDIMVDTPDBQSIHTYVAQPL 
ERFPBLKABDIPDSDKEVPIESTPVRIKBTPSBQBSKVPVLrEN 
GB 3 lTYTVNHBTfiHPPPSKVFVCDXPE 3 MKKPiajDGVSSHAL 9 I >3 
STBFMHQI IDQVLQGGPGKTSDI 3 BP 3 PESSILSSRKEKGRSNS 
LPI KKTVHF BADTYKPPPCSKKbSIiCFEGSPRVAXgSIiRQDGHV 
IAVKVABEKBQXQBSSKIPBSSSDKVAGOIPIiVBGTNHMSQSSS 
CNGALBSTARHPKESHSXiSPPGKtrrWADSFQIKVNLMTVKALE 
BGDYPKAIETjKASKFHSDLIDPASTSQAPJOCVPSPHBTKPDEDA 

bafknhabklgxrsxjcsahxkkdspbpqvkmdkhkphqdsgbba 

BGCPSAPKBTP VDKKP jgVHB KAKRXSTR PHY KRE GSDIHJIiQQVG 
BELSSSPPSSCV 5 LBTLGSHSBSGLDFKP 8 PPZ*SKV 3 VXPHDLP 
YPPHYBVPLAAVLKAYVBDPKDLKHEBMDI£BPEaYMPDLD 5 RB 
EBADGSQSSSSSSVFGBSLPSASDQVLYZiSRGGVGrrPASB PAP 
iaphedhqqrbtkkndpmdshosqbspnisjiabplkbnvtecks 
I BSKKXBKRKHVDHVESSLFVAPGSVOSSDOIJSKDSStJYSI PSR 
TSBSnSSXYLRfiHTHRSSESimPSLCSVKHRSpSG 



SCKTHHLLaWWPQQGLSFLPSALVIWTSAAPXPSYlrAVTLHII 
XDPAIiPYISDIGTVAPBKCItPGAMIJVXAAVliCIATIYVIlYKQVH 
ALSPEEHVI IKUTKAGLVIGILSCLGLS IYAHPQKTTLPAAHVS 
GAS^TPGJCSLYMFVQTILSYQWOPKIHGKQVFHIRLLLVIWOG 
VSAI^NLTCSSVUiSGNPCTDLEQKLBKBP BDKGYVLHMITT AA 
EWSHSFSPFWFLTYIRDFOKJSLRVBANLRGLTLYDTAPCPIK 
HERTRLLSRDI 



KSCVUAMAQLQTRP^DMKKYAVDW 
PPSIPAASKIADLSHI INKUiKDKNBPHKHVBPDPLIKGQFLRM 
PLDKHNBMSNISSKKWBIBYVBKYTAPQPBQCWFHDDWISS IK 
(SiRBHILTOSYWnSKIHSIiBGKSIMTIVGHTDVVKDVAWVKKD 
SLSCUaliSASMDQnlalMXNNVBRHKVKAIiHCCRGH&GSVDSlA 
VDGSGTKFCSGSVfiDKWLKrHSTVPTDKHDHMKESTORPRKKQKT 
EQLGLTBTPIVTLSGHNEAVSSVL»SnABBICSASWDBTrRVWD 
VRSGSXJtS'TLTGNKVFNC^SYSPIjCKRLASGSTDRHXRLWDPRT 
KDGSLVSLSLTSHTGHVTSVKWSPTHBOQLI SGSLDNI VKLMDT 
RSCKRBLYDLAAH BD KVLSVPWTPTGUJiSGGftDKKIiYSYRYSP 
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Amino acid segment containing signal peptide 
(A^Alanine, C=*Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F= Phenylalanine , G=Glycine, 
H=Hiatldine, I=Isoleucine. J^Lysino, 

L= Leu cine. H-Me thinning, H=Aeparagins, 
PaProline, Q -Glut amine, R«AxgindLne, 

SoSerine, T=Threonine, V=V aline, 

S^Tryptophan , Y=Tyroaine, x«Dtokntwn, *=Stop 
Codon, /^possible nucleotide deletion, 

\ ^possible nucleotide insertion) 



VKKESQSLNBPS PKQS BE1PKVTSKPVKGSLHHAQSAQ3 IK3TB 
HPARBDCUKHVS5B PVLSVQSKGVT JjJCRJGtSTiTiBQPVrVHHPGR 
HKLKKQGBTPHBVCMPSLAYGDI PBBLIDVSDFBCSLOWLPPB 
PVTTPCGHSPCKKajKRCIiriHAPYCFIiCKKSLKBYLADBHYCVT 
Q LLHKL1 VKYLPDBLSKRKK1 yDB8TAKLSHLTKNVPI PVCTWA 
TPTVPCPLHVFBPRYRLMIRRSIQfrtSrXQFGMCVSI/rQSISPADY 
GCM1^1HHVHPIJDGRSVVI)TVGGKRPRVI»KII®4KDGYCTADIB 
YLEDV 



VTLYPCRGLVGNLLLGASGMASGCKIGPSIIiHSDLAIILSiECLR 

MUSSGADYLHLDVMDGHPVPNITFCTPWKSIjRKQLGQDPProM 

HMMVSKPBQWVKPMAVAGANQrrFHLBATKMPGAI^I XDlRffl*34 - 

BVG1AIBPGTS V BYIAPMANQIPWAI«VHTVBPQPGGQKjWB1*M 

PKVHHIJnX)PPSXlDIBVDGGVGPDTVHKCAEAGAKMIVSGSAIM 

ftSBDPRSVXNLLRNVCSHAAQKRSLDH 



YPCRGLVGNIiLbGAGGMASGCKIGPSll^SDIiANliGAKCLR 



»Vc* ■> »yj * : i r»i\ ; i *)' \ 



H^IVSKPEQHVKPHAVAGANQYTPHIiBATEHPGALIKDlRWJOf 

KVtSAXRPGTSVBYLAPttAROXDHALVMfTVBPGFGGQKPHBIMM 

PKVHWLRTQPPSLDI BVDGC5VGPDTVHBCCABAGANMXVSGSAIM 
BSBDPRSVINLLRNVCSEAAQKHSLDR 



XCGRWlLdVYETVAFnAaJ^RUUQUUmiUtRIWlSRYn^ 
VPQSLSbKGITKH7iLNHHPPPBKLBBISPTSD3HKKDT8SOSKS 
DITRKSSPTSADTGNSLaAPPSYTGAGISTEGSSDFSTCYGELD 
QKATBKVljaMPTAIDBIiLYBQXI^VHTKSLQBBCQaWtASFPHL 
RlbGRQIITPSBGYRLYPRSPSAVSASYBTTbSQKRDSTIPGIR 
GKKLHFSSSYAHKASS IAKSSSFCSWERDHEDS I IVSBGI IRBY 
LAPDH1D1BBGPHGKKSRAAXHKOKLGYPP1APFYO(KBDVLAY 
VPDSVWCXWSCKEOIiTRSHHEGPASDDBSHVAVTRPDSBSSCV 
L2BLHPLVLPKVPQS KVLYITSNPHS LOQASHHQPNVKDbLVHG 
WPLOPRNLSIifl>ICLLDI>I>DKliLMRPGSSTII*STRHHPNRAVEFS 
TSSLSYTVQSTRRRNPPPRTLHPISTSHSCAETPRSVEBILRGA 
RVFVAPDSLSSPSPTFIiSHKNIiPPXG’EAKVKHVSTVGPQRQ^K: 
PHaDSSRAGSAVVDBPHYQOPQKHLLliPDFPPRPKrrQSFIjUTr 
QfRRSCAVBYPHQARPCTGSAGPQLHGSTKSOSGQlPVSRTRQG 
P 



LVAKMASSASJOnrPAGiOmHQKBLRRL«KBKGRL3TSRKRIB3 
PRAKYMRLGQLS CAI^NTPVKSSLU#QTHVIi3KQHRBKVAELKG 
AKRASQGSSASSAPQSVKRKAPDADDQPVKRAKA TLVPqVQP ST 
SAWTTHFPKEQKBPlRRTPSKPSGIiSltLPPYgPEBgBHgBSgGD 
GBRKRGEASKPbSDAQGKKESVSSSBSVrSSVLP3SDFPSTNPPK 

API iphsgsiekabihbkwkrrbntaealpbgpfdppbvdarv 

RKVDAPKDQKDKEWDBPQKAMRQVNTISEAIVABKDBEGRLDfiQ 

IGBlDBOIKCYRRVBKLRKRQDHlKKKLlCKrLTIKBLQXKKBEN 

ADSDDBGBLODLLSODMRVKGALL 



TCPGAMKMVAPWTRFYSNSCCIiCCHVRTGTIIjIXJVWYI*! INfiVV 
T.r. TrJ^A rAnpnQYTTPfiA^pLGGDyKywuitftfMClAIA rSLLM I 
IiICAMATYGAYKQRAAHl I PFFCYQlFDFMJmVAlTV LIYPH 
SlQEYISQXPPNFPYRDIJVMSVNPTCLVbllLLFISIILTFKGY 
LISCVWCYRYINGRNSSDVLVYVTSNDTTVLLPPYDEtATVNGA 
AKKPPPPYVSA 



FWCTLGTVDVBPIGHCAINSKIl»VPPRTIHAKFTDWKGYLrtKRI» 
VGSRTLPVDPHXKMVBSMKYPFRQGHRLKVVDKSOVSRTRriAVV 
DTVXGGHIiRIiLYBDGDSDDDFKCHMRS PLIHFVGWSRR VGHG I K 
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Amino acid segment containing signal peptldfT 
(AoAlanine, C^Cysteine, D=Aspartic Acid, B=* 
Glutamic Acid, PaPhenylalanitte, G=Glycine , 
H=aistidine, I=Isoleucine , ECsI^slne, 
L=Leucine, tt=*tethionxne, tf^Asparagine, 
P=Proline, QsGlutamine, R=Arginine, 

S -Serine, T=Threonine, V-Valine, 

N-Tryp t ophan , YnTyrosine, X*»Onknovn, *=Stop 
Codon, /^possible nucleotide deletion, 
\ °posslble nucleotide insertion) 



£294 



3S4 



6295 



“ 2795 “ 



6296 



727 



6297 



KISERRSDMAHHPTPRKXYCimVPYLFKKVHAVYTBGGWraBGMK 
IxKAIDPLNUMCV7mnarVLIJXryi^IC^DGGPSTDGLDWPCY 
HASSHA1 FTATPCQKHDIBLTPPKGYKAQTPNKBHYLBKTKSB3V 
APSRXtFHMDCTNHGPKVgCCl^AVPLMBPaLIC VMV KRV V HRl, 
LS IHFDGWDSBYTQWVDCBSPDrYPVGWCHLTCYQljQPPVAAKP 



ATPIJCUJBATKKKKKQFGKKRJQUPPTKTRPLRQGSKXPLIiKDD 
PQGARKlSSSPVPGKrXAVBYKKKHLDVASPDXASSPBIjtPVSVE 
N1KQBTDD 



1814 



AQLTTRGRTVAGGVRWIPSPFPDLBLYSCCtiGTDfeGFPKLSHHC - 

KNVXRTASDYDPftBITNIRPSFDVSPVVAGLIGASVLWCVBVT 

VFVWSCCHQQABKMHKNPPYKPIHMLKGISIYPETLSHKKKIIK 

vrrdkdgpgreggrrnllvdaabagllsrdkdprgpssgscidq 

LPIKMDYGBBLRSPITgLTPGK3KTTSPS3PBKDVKLGSLTPSV 
DYNPPKKALWTIQEAHGLPVMDDQTQGSDPYIIMTIIiPDXRHR 
VBCraVLHKTLDPVPDBTPTFTGIPYSOMIDLVIflPLVLSFDRFS 
RDIT/IGEVMVPIiAGVDPSTGKVQI/rRDI I KRHIQKCIS RGKLQV 

slsyqfvaqrwtvvvlkarhlqkwdiaglsqjpyvbcvnvyygrk 

R IAKKXTHVXKCTI2JP I FNES PIYDI PTDLLPDXSIEPLVTDFD 
RTTKNBVVGRLll/SAHSVTaSQAgHHRgVCgSPRKPVAKW HSLS 
BY 



617 



V88AlXrOATSG3DAAKSEGASASPI^ramV7U1DRPDBGPPAK 
TRHLSSSKSPQRDPPPPPPPPPLIiRIiPLPPPQQRPRLQKKTEAA 
QVIADMRGTOLGPADPPPPPYVILKSGGIBAYFTLGMCPGHDS 
TIESGYGBAPPPTKSLEALPTPEASGGSIiBIDFQWQSSSPGGK 
GAIiBTCSAVGWAPQRLVDPKSKEKAlirVKDKDRDKRKSMRflSR 
HlffiRRRRRKQRKVKRJKRERNAKRHBSILOALBDIQLDLBAVfrr 
KAGKAFI^ILKRKPIC^CRRPPLERRDIJCIQHIPGPWVKAFLNHPR 
IS IL INRRDHDI PRYljTNIjQVDDLRHXSmYKMKLYFOTNPYPT 
HMVIVKEFQHNRSGRLVSHSTPIRWHRGQKPQARRHGNQDASKS 
FFSWPSNH3LPBADRIABI IKNDLWVNPLRYYIjRKRGSRIBCRKK 
QBMKKRKTRGRCBWIMBOAPDYYAVKDI PSBISDIDBTXHDIK 



ISDFMBTroYFBTTDNBITDINENICDSENPD'HHBVPNNBTrDN 



nksaddhettkojesaiiijhnbnpkijnnkntuunbbnpnnnknty 

GNNPFKGGFWGSHGMNQDSSDSDNBADBASDDBnNDaNBGCNBO 
SDIHX32BGlHiEGSDDDDRDIBYYBBVXBDPDED0ADYBDVlBI I 
SPBSVKKEGIBBOIQQDHDIYBBtayYEBE GS K D V WB BGgDSDDS 
DIBDVLOVPHGHANPGKRGKTG 



1199 



RHOXIDAQGACDSI^ETrtJrSSPVTARNAIPBARCCVHlJLDGTTV - 
BAVRPARBRLARKKLRQKra^QQPSRDSAYSSHKDSTCLLTBRDT 
LGTSDQPPSPFSGTXSFGSFSDS&IFPLGSQCCLGFQQPSISGK 
KHALIHKRVRLSVFGAIflfQR X Y PGK 



922 



6298 



QRAAAAS PSSCGPHGABYGAWlAMBGnnfRPLALLGSAbljVGPZiS 
VI PAIiVffiftjHYRBGCLGHDCiSALBFNWHFVttfVTGFVFIQGlAI I 
VYRJoPMTWKCSKLLMKSIHAGLNAVAAII^ ISWAVTENHNVN 
NIANMYSLHSWVGLIAVI CYLWJIiLSGPSVPLLPNAPLSURAPL 
MPIHVYSGIVIPGTVIATALMGLTBKLIPSLRDPAYSTPPPEGV 
PVNTI/JIiLILVFGALIPWIVTRPQWKRPKBPNS7XLHWJGGTBQ 
GARGSMPAY SGNNMDKSDSHLNNBVAARgRHIALDEAGQRSTM 



985 



SVPIJUa^I^GXLQGAGTTTKMAVARIAAVAAWVPCRSWGWAAV 
PFGPBRGLS VLLARI PQRAPRWLPACRQKTSLSPLNH PDLPNLA 
YKKLKGKSPGIIFIPGYLSYMNGITCAIAIEEFCKSLGHACIRFD 
YSGVGSSttaiSEBSTLGiOTRBirVLSIIDDLADGPQILVGSSLGG 
NUfliDVAlARPBKWAUGVATAADTLVTRFNQLPVBLEKBVBM 
KGVWSHPSKYSBBGVYHVQYSFIKKABHHCLIflSPIPVHCPIRL 
I^MKDDrVPSHTSHQVAGRVIiSTDVDVlLRKHSDHRBCREKADI 
QLLVYTIDDLIDKLSTIVN 




482 
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Am.no acid segment containing signal 
(Ac* Alanine, OCysteine, D^Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=LyeIee, 

L= Leucine , K=J4ethionlne, H^Asparagine , 

P= Proline, Q=Glutamine. R=Argioine, 
S=Serine, T^Threonloe, v^. Valine, 
WcTryptophan. Y^Tyroaino, X -Unknown, ♦-Stop 
Codon, /-possible nucleotide deletion, 
\opossible nucleotide insertion) 



BCDLEGIMPHVTXSLSLPTHGSPLQDIXVHPCVTSLDSAI LTSS 
SinAMDESAPSGPYXPPPTPPLBSPNLCFYTSQVPVPPIIiGPYQ 
nkkbkvqlrekh 



AAPSCNSQKGVPAAGTPSSPRLLVSRAAAPSAGPMGAfifSQGARA 
AQS P?S IPNSSSVPYGSQDSVHSSPBDGGGQRJKIPVGGSPGGPR 
LVIGSL PAHLS PBMFGGPKCPVC SKPV SSDBMDLHLVMCLTKPa 
X7fNBDVLSKDA23BCAICLBBLQQGSTIARLPCIiCIYBEGCrDE 
WFBVNRSCPEHPSU 




YWtBYGGGIJLWQSMQKXHPGQALSSBPMWFPDTKBBWBQHYSQL 
WYYLEQFQYSKAQGWTPOASQGQyiDTyTSKTXADDKNDBICQ 1 ! 
KVDLVSPLSSPIMGI3NDSSGTSDKDHSKIIlK?I$NIKIli3EBVT 
QSQLDS CTSHDGHQQLSBVS5KRBCPASGQSBP RH GGTOBB3HS 
SGNTOTDPPAEDSQKSSGANTSKDRPHASGTDGDE3KKDPPSHK 
PSKLKRSHELDIDBNPASDPIffiGGSLLOFKXGSGQXXGGl PNFS 
HRQVa?IiBKKVBjLKSKYIiDMRRQIXWKNKHIFPTKKSHKPFFKK 
SKIL3KVEKPLTWVNKPMDKKASQSSSSHD5IGHDASTSCDSKBQ 
DMSVKKGM3LLKTMNPBPBKC30SVSSACTIiBTH!lYKRDSUiATV 
PDHODCVTQEVPDSRQABTBAEVKKKKNKKXNKKVNGLPPBZAA 
VPKLAKYXAQRYRLPSRFDDGIKLDREGWFSVTPEKXABH IAGR 

vsqsfxcdwvuapogvggntiqfaltgmrvxaididpvkiala 

RNNABVYGITVDKIBPXOGDPliLLASPLKADWFLSPPWGGPDYA 

TABXFDIRTtMSPDGPEIFRLSKia'niNXVYFLPRKADIDQVAS 

U\GP<W^IBQHFLSNKLKTITAYF3DLIRRPASKT 



HRARVSKSRESPG(H)LRHPGRVRRDITLSOHFRLSrQHVVLLRE 
DEVGDPGTKXELOIPGBGSPIQBIDSEWTLVBPIjPGSDMAALPA 
WRATS GLTLWPHTAEGRDIiLGABNRALTGGQQABDPTLASGAYO 
WPSSVKKIXJGSVWCimRTIXSSSJPKJGQAPPWLTDaDVQMLRLL 
AQ3EWDKARVPABSQfVLQVGPSTBAAL(X>LSRPRLSQLCSQGL 
CX1LXKRPGDLPBVLS PHVDH VLGLimSLPAVARRFHS PLLPYRY 
TDGGAH PVXWKAPDV^O^DP DBDQNSIJUjGWX^ YQAIiIjAHS CK 
HPGQAPCPGIHBTEHARZALFDFUUQVHDRIiDRYOOQFSPSPSD 
PCVBBRLREKCHMPABKUiVH ILVRSSDPSHLVYIDNAGNLQH P 
EDKUfFRLIiBGIDGFPBSAVKVLASGCLQNMLLKSIiQMDPVPWB 
SQGGAQGLKQVTjQfTLBQRGQVLLGHIQKHHLTIiPRDKDP 



PTRPSKVKVP9TFLIHSYTRPTVGQACKICLLXGLFRQGLQCKDC 
KFUCBXRCATRVPBDCUSALIBGDVPMEEAXDPSBADKSAIMD 
BSSDSGVI PGSHSBNALHASKBKEGEGGKAQ5SLGYI PU5RWQ 
SVRH’ITUKSSTtLREGWVVHYSWKuTLRXKHYWRLDCXCrrLFX) 
HNTTNRYYXEIPLSHILTVESAQHFSLVPPGTNPHCPBIVTANA 
TYFVGEMPGGTPGGPSGQGAEAARlOWETAIRQAmFVXIjQDAPS 
APQHAPHRQASLSISVSH3QIQHXVDIA3VYQ1PPDBVLGSGQP 
GVVYGGKHRICIGRIVAVKVIDKLRPPT1CQBSQLRMBVA1L0SLR 
HPGIVNLECW?BTPHfCVFVVMEKLfiC33MLKKn*SSKHGRLPERlj 
TKPLIT01LVAIJlHLBFKKrrVHC3)LKPENVIiLASADPPP0VKLC 
DPGPARI IGEKSFRHSVVGTPAYIAPEVIXSQGYNRSIJ)MWSVG 
VrMYVSLSGTPPFNKDKDXNDQIQNAAPMYPASPWSHIBAGAlD 
LDJin^LQVKMRKHYSVDlCSI-SHPWljQKYQTVLDTiRELEGKMGKR 
Y rTHKSDnARWEOPAABHPLPGSGLPTDRDLGOACPPQDHDMOG 



483 
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6307 


2I3? 


589 


CFLLPRGRDPKPPEAGAAAPCAPGAPDMSFRJCWRQSKPflHVFG 
QPVKHDQCYBDlRVSRVTWDSTFCftVHPKFLAVIVKftSGQGAPL 
VI*PI*SKIGRIDKAYPTVOGaiTGPVLDID»CPHNDBWlASGSKDC 
TVMVKQIPKNGLTSPLTKPVWLEG3TKHVG1 IAKHPTARNVLL 
SAGCDNVVI*IliHVCTABBLyRliDSIiHPDLlIHVSWNHHGSLFCS 
ACKDJCS VRI IDPRRGT’LVAKRBKAHEtS^RPMRAI FLADGKVFTT 
GreRMSKRQLALiTOPBHLgBPWIORIISSHaALLPPYDPDrrSV 
VYVCOKGDSSlJUfPEITESPPyiKFLWrPTSKBPQROffiSMPKR 
GLBVSKCBIARFIKLHBRJCCKPIVMTVPRKSDLPXKfflCLyPDTAG 
PEAALEABBWVSGRDADPI LISL&EAYVP5KQRDLSXSRRNVLS 
DSRPAMAPGSSHLGAPASTTTAAnATPSGSLARAGEAGECLKBVM 
QBIJiALfiALVKEQCTRI CRJLEBQLGRMKNGHA. 




■ 


1118 


GRPTRPKKHIiI^LVIiHTYSMRYLIjPSVVlJkGTIAPTYVIJlWGVWR 
LlfiAFLPARPYQAUSDRLYCVfOSIWLPPPBinrrGVQIliLYGDL 
PKNKKNI IYLANHQSTVDW IVADIlAIRQHALGHVRYVIiKEGLK 
HlfPbYGHYPAQBGGIYVKRSAKFNEKEMRHKLOSYVDACTPMYI* 

vifpegirynpkqtkvlsasoapj^RQiavlkhvltprikath 

VAFDO«NYIiDAIYDWTVVYEGIQ>DGGQRHBSPTWrBFLCKECP 

EIHlRIPRIDBJIDV/PKKQKUMRSSfLHBRPBIKDKMUEFYESPD 

P2RRB3lFPGKSVHSKLSIIOCTI*P8MLIL9GLTaG>OlTDAGRKI> 

YVNTVIIYGTLLGCI.WVTIKA 


■ 


mm 


563 


LVABVXBPCSLPnLSVDHENKENGSVOVKHSMEKrGRPPDPADWA 

VMDVVNYFRTVGPKBQflSAFOSOBIDQKSLLLMTPNDVLTGLQL 

KIAPAL1CCYBYKVKPLQTKHLKNNSS 


63X0 


36 


979 


GPRCTKiq^LsSVNCBTLRIOKRWPQSSGQBRYWTPRTHSSASS - 
AQRGSLARLHVAAAGLHADCDQPIiYDCPHCGXjI ctnyhxlqehv 
dlhlbenspqqgmdrvqcsgdlqiahqlqqebdrkrrseesrqk 

IBKPQKLQRQTOLKISGQYKQQQUfflMRIBVNRGRMPPSBFHRR 
KADMMESIAIXJPDDGKTKTSGI IKALHRYYQNAATDVRRVWI*SS 
VVDHFHSSLGDKGHGCQYRNF0MLLSSI»IiQHDAYHDCr>Kt3(I*I P 
CIPKIQSMIKDA>fKEGPDPOGASQPLIISLQGTKAJfIGACEVYll> 
LTSLRV 


6311 


1 


675 


PVWKNSCBGPJdJiAAARTGHGVGRRARIAC3jGBPRVltAAV>ajTL» 
AS KLKRDDGLKGSRTAAXASDSTRRVSVRDKIJWKBVAKLEANIi 

pctckvhppdpnklhcpqltvtpdbgyyqggkfqfetbvpdayn 

MVPPKVKCLTKIHHPHITETGBICLSLIiRKHSIDGTCHAPTRTIj 

KDVVWG32?SLFTDIiLNFDDPLNIRAABHHLRDKEDIT£NKVDDyi 

KRYAR 


1 


213 


1400 


GDKLVKKBAGMKMLPG VGVFCTGSSAR^VPLLHABG PTVRAL W 
GKl^KRAKQIJiKSMHIAFYTSRTDDrLLHQDVDLVCISI PPPLT 
RQI S VKALGIGKNWCBKAATSVDAFRNVTASRYYPQLM5LVQN 
VLRPLPAPVRMRQMSBHYVOAVHICDARIYSGSIiLSPSYGWIC 
DKlifGGGtILHTMGTnrVDIJjTHLTGRRABHVHGIiIJCrFVRQNAA 
IRGXRHVTSDDFCPPQMLMGGGVCSTVTMFNMPGAFVHEVMVV 
GSAGRLVARGADLYGQXHSATQKKLLLRDSLAVGAGLPEQGPQD 
VPlXYLKfflCVYMVQALROSPQGQGDRHTWDRTPVSMAASFBDGL 
YMQSVVDAI KSSSRSGBHEAVBYLTRHFD'IHQHLCEALQRHNIt 


6313 


2 


2071 


QRSGAARLAFLPS PPS PACVHR5Pli3 PHGCWFYFVWPM PLGVL 
FURRRAKGCIXSC3SPVEQPTAMRABEIHBCLQBFPHHHXMILD 
RLNKQRBODRFTDITLrVDGHHPKAHKAVlAACSKFFYKFFQKF 
TQEPLVKI EGVSKMAFRHI^BFTYTAKLMIQG B BKAMTDVWKAAE 
FLQMLEMKALEVRNKEHSAPXiBKNT l'GKNRAKKRKIAETSNVI 

trslpsaksbpveievexaegtikvbdegibtlbbvaeakqsvk 

YIQSTGSSDDSALALLADITSKYRQGDRKGQIKBDC5CPSDPTSK 

QVHGIKIVKLQI^EHVKDLFHCBXCNRfiPKLPYHpKBHHKSHSTK 



484 
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Amino acid segment containing signal peptide 
(Ao Alanine, C- Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, Phenylalanine, G^Glycine, 

H= Histidine, I=Isoleucine, K=Lysine, 

L= Leucine, K=Methionine. N=Asparagine, 

P= Proline, Q=Glutamine, R=sAr gixi i n e, 

S=>Serlne, TsTlneoninfi, VoValine, 
W=Tryptophan, yaTyrooine, X^Onfcaovn, ♦aStop 
Codon, /® possible nucleotide deletion, 
\=possible nucleotide insertion) 



SFKCBICNKRyiiRBSAHKQHLWCYHLSBGGVSBCKaRTGKICIHVC 

qycbkqpdhfghfkehlrkhtgbkpfkcpnchkrfarbstlkch 

LTACQTrGVGAinOCTHKLYRCQVaiSVFHSWDQFgDHLVlHTGJ)K 

RfflCTLGDLHFMQGNELRJUn^DAHHXSBBLVTEBVLSVBTRVQ 

TBPVTSHTllBOTOKVHVI^TXQVQVDSAQVTVEQfVHPDIJXJIlS 

QVHDSHHSBLPEQVQVBYLBVGRJQTKEGrreVHVEELHVERVNQ 

MPVEVQTBLLBAXHJ2HVTPBIHNQHBRBSSQADAABAASEDHED 

AKDLBTKPTVDSBAHKAHNHDHTALPVLB 



QRSGAARLAFTJ?SPFSPACVHRSPI^FHGC3rfFYPVVVFMPl/JVL 

PHRRRAHGCTLSCSSFVKQPTAMHABBTHBCUJKPPKHHKMXIU 

rlheqrbqdrptditlivdghhfkahkavlaacskppykfpqbf 

XQBPLVBIBGVSKMAPRHLIEFTyTAJKLMIQGBBBAtiDVKKAAB 

PIXJHLKAIK7UiBVRNKEN3APLfiEHTTGKNBAKKRKXABTSNVI 

teslpsabsbpvbxbvbiabgtikvbdkgibtlebvasaxosvk 

yiQSTGSSTHJSAIALLAD ITSKyRQGDHKGQXKEDGCPSDPTSK 
QVBGIBIVKIiQLSHVKDIiFHCKKCNRSPKLPYHPKEHMKSHSTB 
SFKCKICNKRYI.SKSAHKQHLNCyHLKEGGVSKKQKIGKXIHVC 
QyCBKQPDHFGHPKEHLRKHTGBKPPBCPNCHKRFARNSTLXCH 
LTACQTOVGAKKSIKKLYBCOVCHSVPNSHBQFKD'HLVIHTGDK 
PNHCTLCDLHPMQGRKLRHHLSnAHHISKRLVTBBVliSVKTKVQ 
TKP VTSMTC IEQVGKVHVLPLlJCfVQVDSAQVTVKQVHPDLLQDS 
QVHDSHMSBLPBQVQVSYLBV0R3QTBEGTBVHVBBLHVEHVKQ 
MFVHVXJfTKLLKADLDHVTPKIMNQBKR^SQADAAEAARKDHED 
ABDLKTKPTVDSBAEKAKKBDRXALPVLB 



LGLAVNWTTLVLXSY'CPTATfiBtApyWTYLLCAIGLPIYQSLDA 
IDGKOARR^SCSPLGBIJTJHGCIISI^TVFMAVGASIAARLGTY 
PDHPPSCSPlGMFVPVCAHKQTyVSGMLRPeKVDVTBIQIALVI 
VFVLSAPGOATHWDYTIPILBI KLKI LPVLGPLGGVIFSCSNyP 
HVXLHGGVQKHGSTIAGTSVLSPGLHIGLIIIIiAIMIYKKSATD 
VPBKHPCLYUHPGCVFAKVSQKLWAHMnCSBLTI^DTVFLGP 
GLLFLDQYPNNFIDBYVVIiWMAMVISSFDMVTYFSAIjCLQISRH 
IJlIJJIFKIACHOAPBCfVQVLSSKSHQNNMD 



■ k'rfnvwytN w :I7V# 4 '4 ri:< 1 



MKKSSPSGSXSQRYSGAYGASVSDKBLKRHVABKLALBQAKKB3 
BDQKRUCQAXEIiDRERAAAKEQLTRAIIiRKRICSHBBRARAKHL 
ARQLKKKDRVLKKQDAFYKEQLARLKKRSSB FYRVTESQyQKAA 
BEVBAKFKRYBSHPVCADLQAKILQCYHKHTHQrTIJCCSAIiATQY 
MHCVKHAKQSKLKKGG 



PBAQTSAVLARBlDGHLPTTIRHHAP^2MASAQUARYGQKDS3nQN 
PDYMPKLLlIGNSSVGKTSFLFRyADDSFTSAFVSTVGIDPKVK 
TVPKNKKRX KLQXWI/raGQKRyRTTrTAYYRGANClF tLMYDITN 
BBBFNAVQDWSTQI KTYSHDHAQVXLVGHKCDHEDEHVISTBRG 
QHLGBQLGPBFFETSAKDNrNVKQTPBSLVDIXCDHNSBSIiBTD 
PAIXAAKWTRLKBTPPPPQPNCAC 



PWHPLHTLPmHPHPRPPRftBGREGADSMSHLPGIiKLRR BAPPL 
LGPliLSPPPLPAGSWHRQMLRS5IilFPITlJSAGAPCKAAGRf<NI 
IAPVRM5RVIABLPQCLRKBAALJIGHKDFHPRVTCAC5QKHRTGT 
VGFKISKVIVVGDLSVGKTCLIHRPCKDTPDKNYKATIGVDFEM 
RRFBVLGIPFSLQMIDTAGQKRFKCIASTYYRGAQAIIIVFHIiN 
DVASLBHTKQHLAnALKBNDPSSVIJjFLVGSKXDLSTPAQYAIjM 
KXnALQVAQEMKAKYWAVSSLTGKNVRHFFFRVAALTFEANVLA 
KLHKSGARRIGDVVRrNSDDSHLYLTASKKXPTCCP 



AATMRl/^QNTLlJjQKKVVLVPYTSKHVPSRyHHWMKSHBLORLT 

asbpltlbqbyamqcswqkdadkctfivldabkwqaqpgatees 

OIVGDVNLPLTOIjBDLTLGBIBVMIABPSCSGKEWSTEAVLAML 
SYGVTTIjGLTKFKAKIGQCliKPS I RMPQKLHPBQVATSSVPQHV 
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Amino acid segment containing signal peptide 
(AaMLanine, C^-Cyeteine, D»Aspartic Acid, B= 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
B=Hietidine, I=Isoleucine, K= Lysine, 

L= Leucine , ft=Me thionine , N^Asparagine, 

P= Proline, Q=Glutamine, RoArgiaine, 

S=Serine, T^Threonine , V=. Valine, 

»=■ Tryptophan , YaTyrosine, X^Dnknown, *=>Stop 
Codon, /-possible nucleotide deletion, 
\=posslble nucleotide insertion) 


mmm 




■ 


B ** i pi n:**i i i»j U :i .• U 4 :1 •'3SW 3TH 1 


6320 


90 


1111 


RPRIGRBKVAMAAVDSFYILYRBlARSCHCYMKAIiALVGAWYTA 
RKSITV3CDPYSLIRLUPIPRLGSRADLJKDYGRHAVVSGATDG 
IGKAYABKLASRGLNI IIJSRMEEICLQVVAKDIAiyiYKVBTPI I 
VADPSSGRBIYLPIBRALKDKDVGILVNIJVGVPYPYPQYFTQLS 
RDKLHDIINVNIAAASLMVHVVLFGMVB8KKGAIVTISSGSCCX 
PTPQLAAPSASKAYliOHFSRALQYBYASKGl PVQSLI PFYVATS 
MTAPSNFLHRCSPfLVPSPKVYAHHAVSTIi3ISKRTTGYWSHSI0 
FLFAQYMPEHLHVWGANIX/JRSLaHBALSCTA 


6321 


tSis ““ 


341 


HRXAA£/1AU!AGRLLGKALAAVSLSLALASVTIRSSRCRGIQAP 
RNSPSSSWPHLNTNVMSGSNGSKKN5HNKARTSPYPGSKVERSQ 
VPRKKVGHLVEHUD V &PVHYTAVSVLASPRHADPQISBSNPSPR 
FNEKDGHVBRKSKNGLYBIBNGHPRNPAGRTGLVGRGLLGRNGP 
NHAADPIITRRKM)SSCTXIHHPVSCaanLQFVAIKRiaXX3^ 
IPGGMVDPGBKXSATLKHBFGKBtRmSLQKTSABKRBlBBlOtjHK 
LPSQEHLVI TKGYVDDPRHTtWASfKJSTKAVNYHDBTGBIMDNLM 
LBAGIXIAGKVRNVDINDKLKLYASHSQPIKLVAEKRDAHNSBDS 
RADCHAL 


1 


2047 


1085 ^ 


NQBlLKNVBSSRTVQPHPLRPLI*3IiC3WSVDVGRHPGWTGHV3 , I3^ 
11$ IN OCJ3DORGSQQBBVT aSBPlGASIFHGQKXVLYYADALTBI 
AFVVPSPVnBSLTDSIiBSNISDQDSDSNHDLMPGILKQPSLTtiBL 
PPNHTDNLNSSQRLSPSSRMRKLPQCTE'VPPLGPBTRVSWWVB 
RYDDIQJFPLSBLWTKISTGVBTTANSSTSLRSTTLkkJkvPVIP 

ihplntglprikiqgatgjcfnnviplvdgmivsrralgplvrcjt 
VINI csjucrlrsdsyspphvrrkqkitdivnkyrnkqlbpepyt 

SLFQKVGLKNCSS 


6323 


1 


656 


PASTTDGAQKARVPLDGAPHIPRPPAGSPKGCFACVSKPPALQA 

PAAPAPEPSASPPMAPTLPPMESKSSKTDSVRAAGAPPACKHLA 

BKKlTfnJPTTVIEVYPDTTEVIlDYYLIfSIFNPVYIjNFCCLGFIA 

IAYSUKVRDKKLLNDLNGAVKDAKTDRLINITRSGLAASCIHLW 

MALSVXATHRGLRSS^ILVABPHDWNTRRPQVTPRERCPAL 


6324 


1 


20S\ 


EGAGMRIUrPCRGSLNRAfiAGALPAAARMGLBAPRGGRHKQPGQQ 
RPGPGAGAPAGRPBGGGPWARTEGSSLHSRPBRAGLGPAPGTE3 
PQABFWTDGQTBPAAAGLGVETBRPKQKT2PDHSSLRTHLEWSW 
SBLGTTCLWTRTOTDOTJfTDPHRSDLQPTQPKKASPVrQPGVHOP 
WTKLBTHGSQTOPBRVKSyiADNLWrHONSSSLQfTHPEGACPSKB 
PSADGSWKKLyTDGSRTOCHJIBGPWTKPTfTDGSOJCKQDarXAARK 
QPGTCGPQIQQDTDGSWTQPSTDGSQTAPGTDCLIjGJBPKDGPLH 
BPKPGBLLTHLYSHLKCSPLCPVPHLI ITPETPBPRAQPVGPPS 
RVEGGSGGPSSASSFDBSBDDW?^GGGGASDPBDEJSGSKPWKKL 
KTVLKYBPFVVSPaKHYPWVQLSGHAGNPQftGKDGRILKRFOQC 
EQRSLBQX^IKDPLRPPVPAnroMVLQDGQTFNQMEDIiLADPBGP 

simdckhgsbtylbbklvkarbrprprkdwybhmvavdpgaptp 

kbhaqgavtkprimqwhbtosststlgfrxegikkadgtottnf 

KKTQALBQVTKVLSDPVDGDHVTT<QKYVACTJDlI,RKAI,glSPPP 

KTHKVVGSSLLPVHDflTGLAKVmiDFGKTVALPDHQTLSHRLP 

NAHGNREDGYLKGLI»DaCLLQGLAQS 


6325 


165 


944 


GLRDP FRRKKHLKPQVKMSNYVNDMWPGSPQBKDSPSTSIISGGS 
SRLS3RSRSHSFSR3SRSHSBVSSRPSSRSRRSKSSSRSRRRHQ 
RKYRRYSR3YSRSRSRSRSRRYRBRRYGFTRRYYES PSRYRSRS 
RSRSRSRGRSYCGRAXAIARGQRYYGFGRTVYPBKHSRWRDRSR 
TRSRSRTPFRLSBXDBMBLLRIAB3NAAEALGTTNIDLPASLRT 
VPSAKBTSROIGVSSNGAKPKVSILGLSEQNFGKANCQI 


H 


238 


680 


gbpspatqqkpsatgagvlhqhfssotiyvlmgllpppwtisft 

VQTTLQPPGGLPAAPV5GRMAFBPVGRDLARRMVPRAGKRTQTL 

GARRVAAQGARPLPBDRRPKSGBRIflVTYAPCHEFVLPSVSLTA 
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Amino acid segment containing signal peptide 
(AoAlanine, CaCysteine, D=Aspartic Acid, B» 
Glutamic Acid, Phenylalanine , G=Glycine, 
H=Hiotidine, 1° Isoleu cine, KaLysine, 

1>= Leucine, M=Hethiauine, N=Asparagine, 
P^Proline, Q=Glut amine, RnAiginJLne, 

Serine, TaThreonine, V=Valine, 
W&Tryptophan, Y^Tyrosine, Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



slarlapaggswmptqqpaapstrapkpshslsgslcalfsda 
DSGSGMBCAKLPPGPGAVGRBMTXHBKLQL aKBKKQQKKKRKBKK 
GAHPKTGSAVSAAQCQGPTRSLPSSGIQLGTPRBKVPAGRSKAB 

lraerrakqkakralkqarkgbqqgpppkaspstagbtpsgvkk 

I^BYPQVmJLLLRHLVKKPBROQVPTRKDYGSKVSLFSHLPQYS 
Rq«SLTQFMSIPSSVXHPAMVRI^LQYSQGLVHGSHARClALLfi 
AbQQVIQ UX T rPPH&KL SRDLVKKLKPYMS PUQCRPIjSASMHM 
AIKFUnCEITSVGSSKRBBKAKSELRAAIDRYVQBKXVLAAGAI 
SRPAYQKISNGDVILVYGCSSLVSRXLQEAIITEGRRFHVVVV0S 

rpw^ghhtlrslyhagvpasyllipaasyvlpevstbskdskv 
GGBKV 



UASABVTTAAARGIX2AKEBBHUTDAKIRABNGIGS5 PRGPGCSL 
RHFACBQHLLSRMX5SASPI<XIDTSVLAGVtGPABVKVBKBlPN 
KATLBVXLRPiaGLPGVABKBKKRLrmiTCRAVVljGTLWPRTSI 
TVVLqVVSDAGS LLAgJJ gAACMALVDAGVWgALFOGVACALD 
SDGTLVIiDPTSKQBKKARAVLTFALDSVKRKLIUSSTKGLYSDT 
ELQQCLAAAOAASOHVFRFYRBSLQRRYSKS 



SfiEVAAGG0TRSAMAB333BWTV 
SHPLSHKLEDOCLBTRLDNDKBMLKAIjKALSTFFVBNSLRTRRHL 
RGDIKRKSLAIHHKFVSIFKBVKKKIjBSISKDVOAMSNCCQDMT 
SRIXlAAKBQTQDLIVKTTKIiOSSSQKIiKIRAaVAnAFLSKFQLT 

SDBKSLLRGTRBGPX 'i'KUKFKALGRVKQIHMDVKVXiLRTlJQQ^ 
GLBIMEQMAIXQETAYBRLYRKAQSECKTLTQKSCDVS PVLTQA. 
MKALQDRPVLYKYTLDBPGTARRSTWRGPIDALTRGGPGGTPR 
PIEMHSHDPLRYVGDMLAWLHQAT7 \SBKBHI£AIiLKHVTTQGVB 
BWXQBWGHITBGVCRPIiKVRIEQVIVABPGAVLLYKISNIiLKF 
YHHnSGXVGHSATALLTTIEEMHLLSKia FFNSLSLHASKLMD 
KVBLPPPDLGPSSAiaiQTMiLRBVLASHDSSWPIiDARQADFV 
CfVLSCVLDPLLQMCTVSASNLGTAEMATFMVHSLYMMKTTLALF 
BPTDRRIiKKLQFQIKAHX»DTLINBQASYVXTRVGIiSYIYNTVQQ 
HKPEQGSLAHMPNLDSVTLKAAMVQFBRYLSAPDNLLIPQLNFL 
LSATVKEQlVKQSTHLVCRAYGEVYAAVMNPXNBYKDPKNri^Il 

SPQQVQTLLS 



FFYYTFYBEnCTPSRKHVABKETMUiKCPDKMPKRTKLIiBQQPL 

pvhophslvsbgftvkammkhswbgppaab^pkbrptkptafr 

KFYBRC®FPIALBHDSKra?XXASfKVKlKKIiDYHHYlJPIiFFtX3LC 
BHTFPYBFPARQGXHDJgiBHGGM KZLPVLPQLI I PI KHA L HL B N 

HQVICVTLKVLQHLVVSAEMVGKALVPYYRQII*PVI2JIPKNMNV 

nsgdgidysqqkrkntgdliqstlbafkryggkhafini kywp 
TYBSCUUJ 















I ESE23E3J! 







NLLDYLPBILDGLPQILGDKGKKIRKKCBVVLGBFLKBIKKNPS 

svkpaemahilvihcqttmliqltamchmrbfiql^vmi^ 
SSGILTAVLPCLAYDDRKKS IKBVANVQIQSLMKLVTPBDDBLD 

BLRPGQRQABPTPDnALPKQSGXASGHWTPSLHLTSCRGPRKPD 
VIGVALGPHLSKQDYFMYVTHTTVAATQRSGSSGSPPFCRQI)TG 
KLSTMATHSQLVKTGTGLKPRQAVSS SH 



TRTPSBABAGGBSPQSCVSAAHSDHTAGKPVSLLAPLIPPReAG 

QPLTFSPSGRQPLRSLLVGMCSGSGRRRSSIiSPTMRPGTGABRG 

GUWGHra^AP«QfflPMGQRANMPPVPHG»!PQWMPPI«5GPP 

MGQMPGMHSSVPtPGHHMSHMSQASj^PALPPGflroSMPVAACrCftS 
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Amino acid segment crmt' ainjn g flignal peptide 
(AsAl anine , Cs=CyBteine, D=«Aspartic Acid,* H~ 
Glutamic Acid, P«» Phenylalanine, G=Glycioe, 
HaHiatidine, I^Isoleucine, K= Lysine, 

L= Leucine , M=Methiooine, N=Asparagine , 
P=Proline, Q=Glu tamine , R -Arginine, 

S=Serine, T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, XcOnknovn, *=Stop 
Codon, /^possible nucleotide deletion, 

\ -possible nucleotide insertion) 






SS PLHVAVVCSSNQEiRSHRAHN 1LSKR 
GFSVRSFGTGTHVJOjpGPAPDBCWJVYDFKTTYDQMYHDLLaKDK 
BLYTQKGI1 *HMIiDHH1CRIKPRPERFQNCKSZiPDL11jTCRBRVYD 
QVVKDUtSREQETCQPVHVVNVDIC2DWHBBATLGAFlj ICBLCQC 
IQ3TEDMHNBI1JK1X<GBFKBKS(31TFLHTVCFY 



AARARPGVLCCRIiLGAALQDQSRVBMSYlFGQPVTAVVQRVBXH 

KLRQGEHLILGP3rGGGIDQDPSQHPFSBDK3T)KDIYVTRV5BG 

GPABIAGLQIGDJCTMQfVKGfnJMINVTHDQARKRbTKRSEEVVKL 

LVTRQSLQKAVQQSKLS 



HBPAS KGRAEVGHMRLS VAAAISHGRVFRKMGLGPBSRXhLLRN 
IXTQLVRHKRIBAP«ARVDB«GYABKLirYaKI/2]TI*KHAHRM 
ADFWLTKKDLI PKLPQVIiAPRYKDQTGGYTRMLQI PNRSU)HAK 
MAVIBYRIRICLPPIiPLPRROSHLTIiIJTQLLQGLRQIHjROSQBAS 
NH3SHTAQTPGI 



EGIQMLSVQPDTKPKBCAGCNRKIKDRYIiLXAIJOKYWHEDCLKC 
ACOKIRIilBWjSTLYTKAHLILCRRDYIiRLPGVTGNCAACSKLX 
PAyKMVMRAKD>T\rTHlJ3CPACQLCK0RPCVGDKPFLKNN!i I I£Q 
TDYBEGU5KEGYAPQVR 



APNSBSGTQGPLPTPANLFVfrRRANPDPTTSMSATDRWGPKAVP 
GLRLATiLT JJ GT/3TPKSGVQGQEt5LDPPBYDGVDRVIWVMAKNY 
KNVFKKYHVLALLYHBPPKDDKASQROFEMKBLrLBXtAAQVLED 
KEVGPGLVDSBKDAAVAKKLGLTBVDSMYVFKGDBVXEYDGKFS 
ADTIVHFLIiDVLKDPVKLIEGKRRLQAFSNIBDKIKUGYPKSK 
DSRHYXAFBQAAEEPHPYIPPFATFDSKGAXKLTLXLNBIDPYB 
AFMBBPVTI PDKPKSBEBXVNFVBRHSRSTLRKLKPBSMYEniB 
DDMDQIHIVAgABBADroQPBFIigII»KAVACfD W TEWPDL3I IWI 
DPDDrPLLVPYHKKTPDIDLSAPQIGVVNVTnADHLWMEMDDBB 



NRCDRGGC5eQAHR0ftGQGCRTQGASPGFGPGHSPFSQOTMKAFH 
TTCVVLLVF GSVS KAKPPPFKDRKDIVBYPDMDgABFBDVWBDS 
VTKSPQRVI ITRimKDSTTVKLBGQDKBfQSGDPEDADTOEGDTK 
SKPYDDKKFEGYBDKPOTSSS KNKDPXTXVDVPAHLQNSWBS YY 
LgTLHVYGLLAYIifflYI iqKHI E MS Bl«ftQftWgWrHRELLESMPTL 
VGDDGTHXEATSTCKI^ENKHIYNIiHCSGHVCCBfflliaQI&PL 
KRQDLLHVLARKHRPV3DQVQ X KVTMNDEDMimVFAVGTRK^L 
VRI/2KEMQDLSEFCSDKPKSGAKYGLPDSLAXLSHtGBVTDCam 
DTKMVHFLTHYADBCIKSVHPSDQPSGPKIHQKHGOPLICLPirrKR 
tllltphvpgsgbttypkiwballplmnmviysidkakkfrlhrr 

GIQKADKNRARVBEHPLKLTHVQRQEAAOSRBKBKKRAEKHRIM 

KEBDPEEQRRLEBAALRREQK21£KKQMXMXQ1KVEAM 



KACAHTLSCPAFARLGRARRRPKMSHRTSSTFRAKRSFrlSSSSS 
SSSSTSSSASRALPAQdPH 4EKAL3MP3DD?GSPMRPHSBPLAF 
PARPGGAQCt KTLGDAYBEAVDVIUDFS PBDI IVTTSNNH I BVRA 
BXLAADGTVM?3NFAHKDQLPKDVDPTSVTSAIiRKDGSLTIRARR 
HPHTKHVQQTFRTHIRX 



KHAVLSAPGLRjCIPRILGLJISSVGPAVQARGVHOSVATDGPSSTQ 

PALPRARAVAPKPSSRGBYVVAKLDDLVHWARRSSLRPHTPGLA 

CCAVEMMHMAAPHYIMDRPGVVFRASPRQSDVJfn/AGTLTNKMA 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, IVAspartic Acid, 
Glutamic Acid, F^Phenylal anine , G=Glycine, 
H=»Hiotidine, I-Isoleucine, Lysine , 
L=>Leucine, tt=Methionine, ifcAsparagine , 
poproline, Q=Glut amine, R^Arginine, 

S=Serine, T-Threonlne, VeVallne, 

»f=rryptophan, YoTyrosine, Unknown, ♦-Stop 
Codon, /^possible nucleotide deletion, 

\ ^possible nucleotide insertion) 



PALRKVYCQMPSPRYWSM3SCAHGGG«HySYSVVHGCDRIVP 

VDIYlPGCPPTAEALLYGIbQLQRKIKRBRRLQUgYRg 



lOv-Vi :*; v ! f gjgU ci;.' r ; :l t'l iyiTj 1 



KPLEGVKIXiDLTRVIjflGPFATWNLCTLG AH y XKVKR PGAGtElTB 

TWGPPFVGTKSTYyLSVNRNXKSIAVHIKDPKGVKIIKKLAAVC 



OPAGYnAVASAVSGIMHITGPBVACLSHIAAHYLIGQKEAKRWO 

TAHGSIVPYQA PK TKD OnW GACaSWOQPATVCKILDLpgLXray 

SEnCIMUIiHVBHRKBLZXZZiSBRFBBBLTSKSLnfBGSGVPYG 

PIHKMKMVPAKPQFVLHKGLVMEMKHPT V GKIS VPGP AVRYSKFg 

MSKARPPPLLGQHTTiniiKEVLRYIJDRAIGELLSAGWDQHFrH 



GTAMVSDKDKUttiLVIVVnANPIlfWGKOALKBSQPTLS KCIDAV 
MVI£3fSHI*FMHR£33KLAVZASHI QBSKFLYPGKHQRLGDPPGDP 
<^PBFMPSGSXDtmELLTSAHEVZVBBlKDUmSDIKGQRT 
BTLLAGSLAKALCYXHRHNKUVKLBiQBKKSRILVTKAAEDSALQ 
YNNFMNVI FAAQKQ!JniIDACVLIlS0SGljLQQACZ)lTGGLYLKV’ 
PQHPSXiLQYLLffVFLFDQDQRSOLILPPPVHVDYRAACFCHRNL 
IgIGYVCSVCLSIFCWPSPICTTC B T A FKISLPPVLKAKgKKLK 
VSA 



!WPTATIi2MGyGMASPG LAAPSLTP PQZJOTHLQQFPPQArR 
QSLLGPPPVUVPMHPSQFTTLSCBINPQKQARTS33TTPNHKDSSS 
QTMPVBDKSDPPKGSKEAABPRMDTPBD®3LPPCPEDIAKKK8T 
PAPBPBPCBA3BLPAKRLR3SEBPTSKBPPGQLQVKA0P0ARMT 
VPKQTQTPDLLPKALaAQVIiPRPQPRVLQyQAOVOSQTQPRI PS 
mQ7QPKLQKQA0TaXSPEHLVLQQKQVQPQLQQ3ABP0KQVQ 

pqvopoaksqgprovqlqqbaeplkqvqpgfvqpoahsqpprqvq 

IX3LQKQVQTQTYPQVHT0AQPSVQPQEHPPAQVSVQPPKQTHEQ 
PHTQ PQ VS1AAPBQTPVVVHV CGLKMP P DAVRAGGGMKKT1.PE P 
VGTQVSHEBJQHBSAOGLDVGBCENRARBMPGVTOAGGSIiKVTI 

ix^dshafstvpltpvprpsdsvsstpaatstpskoalqpfcy 

ICRASCSSQQB#QDHMSBPQBCX2RLGJSlQHMSOACltLSIiLPVPR 
DVLBTKDRHPPPRRHCNTCQLYYHCSJLIQHRRTQDHKXARQSLR 
PFCTVCKRY Y KTPKXWKU VKSQGKKDKAKBZtKSLBKBlAGQDB 
DHFITVDAVGCFBGD 8BBBBDDBDBSB IBVKBHLCKQVRSRDIS 
RKENKGSBTySPHTAXGVDFLVPVXGYXCSlCBKFTHSNSGAQL 
SHCKSI/JHPKNLQKYKAAKNPSPTTRPVSRRCAINAHNALTAIjF 
TSSGRPPSQPNTQI>KTPSKVTARPSQPPLPRRSTHI*KT 



PRVHTKLI LLVNDKKRYERVGGGPKRLOR0V»IEEZUB0n0BKV 
RBLEKQNDTLKNRLISAKQQIiQTQGYRQTPTfNNVOSRIl?TGRRK 
ANKNAGLQECPEKG IKPQUADVABTPHPtlfc' 1'JUWiSliLEEARGR 
IRNLBIJVIQSQRGQIKKLKHLABXUrraLRRKEHBIKLSIiLQLR 
BQQATDQkSWIBPMVBMIKLHKOLVBKSaALSAliBGaCFXOLOBK 
QRTUQSnEtAlMUfGDBLIilMQLKBQBLKCCSLEeCQLHSHKFSBR 
RI RELQDR IMDLBKSRBLLKBSIYDKLYDSAFSAAHBBQHXLEBQ 
QXJCVQXAQLBTALESOLTDRTBXLDRLKTERDG^BKLVQEKRBL 
QLQYLEQKQQLDBLKRRIXLYHQBlTDZMAraLSEALIalArRAOKE 
QKNGDLS FLVKVDSB1KKDLEBSMRBLQA1BABTVQBL3KTRNM 
LIKQHKIIJKDYGHKVKAVTTIKMKBIjQQDYKLKVEQYVHLLDIRA 
AK IHKLKAOLKDIA YGTKQYK FKPBIMPDDSVDEPPETX HLBRG 
EHLF8 IHIHK\nFSSBVLQASGDKBPVTPCTYAFYDFBrjQrrTPV 
VJUa^HPEYNFTSOYLVHVNDLFLQYIQKHTITliBVHQAYSTBYS 
TIAAOQLKPHKILBKSGRIFCTASLICJrHSDIMPGTVBYWFRL 

rv?mdoairlyrbrakalctitshfkgpbhmqsi»sqqapkxaol 

S3TDSTDGHLHKMITIRCC3JHLQSRASHLQPHPYVVT1CFFDPA 
DHBTAI I PSSHDPQFOTHMYFPVPMNMDLDRYLKSBSLS FYVFD 

dsptqbwtttgkvhvplisiahdrcisgipsltcmqhhpagtxh 
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■■ 








ECAITCSMSHGPYDSNQPHRKTKlTFKEDKVDSTIilGSSSHVBW 
KnAVHUPKNRSDDEEKKKKGPVSPRNLQBSEBBBVPQKSWDEG 
TSTLSIPFEMLASYKSTSSTFHSLSBQQfVCMAVDIGRHRNDQfVK 
HEDHHATGPRL3HELLDEKDPBVLQDSLDBCY STPSG CLBLTDS 
CQpyRSAPVVLBQQHVGLAVNMDBrBKYiQEVBBDQDPS CPRLSR 
KT>U)KKKPHVLQD3LGRCY9TPSGYIiBIiPDLGQPrSSAVYSLEB 
QYLGIiAIJJVDaXgKDOBBSEDQGPPCPHTvSRKLLBVVHPEVLQD 
SIDRCYSTPSSCLEQPDSCXiPyGSSFyALBBKHVGPSLDVGBIB 
KjUjlUi^JtBRGBRSKMgRBRGRKEGggDQMPPPPRLSngTJJRKn 
PBVIjQDSLDRCYSTPSGCLKKTDSOQPYHSAPYTLBQQRVGIAV 

dmdki exyqevebdqdpscprlsgbijldbkhprvlqbsldrcys 

TPSGCLKLTDS CQPYRSAFYXUSQQRVGLAVDMDg IEKYQBVKB 
DQDPS CPHLSRBLLDBKBPBVLQDS IiGRCYSTPSGYIiKLPDLOQ 
PYS6AVYSLBBQYLaLALDVt3UtKKDQHKEKDQGPPCPRI*SREL 
LBWBPBVLQDSLDRCTSTPSSCLBQPDSCQPYGSSFYALKBIOi 
VGF3LPVGBIBKKGKCKKRRGRRSKKBRRK3UCBGBBDQNPPCP 
RLSSMLMBVEEPKVLODSLDICYSTPSMYPKLPDSPQHYltSVFY 
SPKEEHISPALYVDNRFPTLTVTSLHbVFC^OVIFPQ 


6349 


3 


3^79 

| 


AGAkKCFVTLIACFIjAKQQHK/KYBECKDLIK3MIjRNEbQPTffiB 

KLAEQLKOABBLRQYKVIjVHSQERELTQIiRBKLREGRDASRSLN 

HnjQAIiI/rpDEPDKSQGQDLQBQLAEGCRXiAQHLVDKLSPBNEW 

dddkdvqvbvabkvqksssprbmqkrekkbvpbdslkbcaitcs 

n^spcdsnqfhknikitfkbdbvnstlvvdrbsshdbcqiain 

HiPVPGPTSSATHVSMWSAGPLSGKKAAIH I X#B INEKXiRPQLA 
KKKQQFRNLKKKCFLTOIACFLANQQNKYKYEECICDLIKFMLRN 
BRQFKBggLABQLHQftBBMtQYKVLVHSOBRELTQLRBKIiREGR 
DASRSLHEHLQAIiLTPDEPDKSQGODLQBQIjABGCRXiAQHIiVQK 

lspbndndddbdvqvbvaekvqkssaprbmpkabbrevpbdslb 

BCAITCSNSBQPYDSNQPHRKTKITFEEDKVDSTbIGSSSHVBW 
KDAVHI I PENBSDDKESESKGPVBPRNLQSSKKKEVPQKSHDKG 
YSTLSIPPBMIASYKSYSSTPHSLBBQQVCMAVDIGRHRWDQVK 
. KKI2HKATGPRI>SRKIJJ3 KKGPEVIiQI)SLDKCYSTPSGCLEX,TIXS 
OQPYRSAFYVLBQQKVGSLAVNMDBIBKSQBVEKDQDPSCPttLSR 
KLLDBKBPBVLQDSLGRCYSTPSGYLBLPDLGQPYSSAVYSLKE 
QYLGIALDVDRIKKDQBBBEDQGPPCPaLSHBLLBVVBPBVLQD 
SLDRCYSTPSSCLBQPOSOQPYGS3F2AUSBKHVGPSLDVGBIK 
KRGKGgKRRGRBaZKBRRRGaftKBOSHDQgPPCPRLSRgLLDBPl? 
PBVLQDSI^CYSTPSGCLBIiTDSCOPYSSAFYILEQQRVGLAV 
IWE«XBKYQBVEKDaDPSCPRI*SGHLI*nBKBPKVLQBSU>RCYS 
TPSGCLELTDSGQFYR9AFYXI*EQQBVGLAVZU4DBIBKYQBVEB 
DQDPSCPRLSRBXiLDKKKPBVLQDSLOiCYSTPSGYLBLPDLGQ 
PYSSAVYSLKEQYL3LRLDVDR1KKDQBBBEDQGPPCPRLSRKL 
IjBWHPBVIjQDSIJJRCYBTPSSCI^QPDSOQPYGSSPYAIjKKKH 
VGFSLDVGEIRKKGKBKKRRGRRSKKKRRRGRKEGBKDQNPPCP 
RI^SMIrfffiVBBPBVLQDSLDICYSTPSMYFELPDSPQHYRSVFY 
SPBKBHIS PALYVEWHFFTLTVTSLHLVF©<GVI FPQ 


6350 


3 


— 3679 


^SgaEKCFVTIjLACFIJ\KQCR?KYKYBECKDLI KSMLRNBIiQFKKB 
ELAEQLEOABBLBQYKVLVHSQBRBLTQLBBKLRBG&DASRSUf 
EHLQALLTPDBPDKSQGQDbQEQLABGCBlAQHLVQKLSPENDK 
dddbdvqvrvrhkvoksssprbmqkaebkbvpedslbecaztcs 

NSHGPCDSNQPHKNXKITFKEDEVNSTLWDRBSSHDECQDAI^ 

ILPVEK5PTSSA3NVSMWSftGPI*SGEKAAINILBIKKXLRPQLA 

KKK^PRKLKKKCPLTOLACFIAHQQKKYKYKBCKDLIKPMLRN 

ERQFKBBKLABQLKQAKELHQYKVIjVHSQHRBLTQIjRBKLREGR 

DASRSLNHILQALLTPDBPDKSQGQDUJEQLAEGCRLAQHLVQK 

LSPENDNraDKDVQVEVAKKVQKSSAPREMPKAKKKEVPBDSLB 
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beginning 
nucleotide 
location 
corresponding 
to fixBt 
amino acid 
residue of 
amino acid 
sequence 



nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 




Amino acid segment containing signal peptide 
CA=Alan.Loe f C=> Cysteine, D=>Aspartic Acid, E= 
Glutamic Acid, Phenylalanine , G=Glycine, 

HoHistidine, I=Isoleucine, K^Lycine, 

Leu cine, M=Methionine, N=Asparagine, 
P=ProlinB, Q=Glutamine , R«Arginine, 

SsSerine, Threonine, V=Valiae, 
tf ^Tryptophan , Y=»Tyrosine, X^Unlcnown, *=3top 
Codon, /^possible' nucleotide deletion, . 

\ -possible nucleotide insertion) 



KCAITCSNSHGPYDSNQPHRKTKITFEKDKVDSTLIGSSSHVHH 
EDAVH X 1PKN K S D D KKEgBKG PVSPRHIiQRS BgKgVPQKSWPBG 
YSTLS I PPBWASYES YSSTFHSLRSQQVCMAVDICHHHWDQVK 
KKDHKATOPRLSRSLItDRKGPBVLQpSLP&C^ayrPSGCIiBLTDS 
CQPYRSAPYVLBQQPVGLAVJSMDBIEKYQBVEKDQDPSCPRLSR 

elldbkepevlqdslgrcystpsgylklpdlgqpyssavysleb 
QYLG1ALDVDRI KKDQKBEBDQGPPCPRLSRKIjLBVVEPEVLQD 

S LDRCY3TPSSa#EQPDSOQPTOS3FYALKKKHVGPS UJVGBIH 
BGKg DqSPPC PHLSRHLLD EKG 
P KVbQBSliDSCYSTPSGCLBLTnSCQPYRSAFYlLKQQRVG LAV 
MOBIBKYQKVBBDQDPSCPItLSGRLUJBKEPBVLQESLDHCYS 
TPSGCLELTDSCQPfHSAFYILBQQAVGXAVDMnBlEKYQBVBE 
DQD PSCPRLSRBLIJDHKKPBVIjQDSIjGRCYSTPSGYIiBLPDLGQ 
P YSSAVYSLBEQY03LALDVDH X KKDQBKKKDQGPPCPRIiSRKL 
LBVVKPBVLQnaLDRCYSTPSSOiKQPDSCQFTGSSFYALKBKH 



k V m J.'j p ii'.n :« a ;< .1 ,tci a :Ts cTnTXm < ;«:< ; r J a L*; CJ U ‘J 



RLHSWLMBVBBPEVLQDSIJ3ICYSTPSMYPELPDSPQHTPSVPY 

SFBBKHISFALYVDNfiFPTLTVTSLHLVFOMGVIPPQ 






RTVQALPHGPRQMSBhgi»Pl<mfPBBARI«LABIQAVTXiVSAPRP 

OSKHHSLALTSFRRQQBBSFQBOSALAABARBTRRQHLIfBKITB 

goaakkqklbqasgasssqeagssoaakkdetsdgqasgbqkba 

GPSSSQAGPSNGVAPLPHSAHiVQLATARPRPVKARPIkDWRVQS 
KDWPHAGRPAHELRYSIYRDLWBRGFFLSAAGKPGGDFLVYPGD 
PLRPHAHYIAQCWAPBim PLQDLVAAGRLGTS VRKTTiLLCS PQ 
PDGKWYTSLQWASLQ 



WSBWI^PCHAAKXZKGLSMLRlTMICniAIsLAAhATEFVQGRSAP 
AMARSLVmmrFrajSVYQVKISPTPQOSAASSAEtarVGQGAPG 
UWMMNPEGGVHHEW3CmDGC»lXPEGGGGKQBPRQQPQPPPEE 
PAQAAMBC^PBNMQPHTRRTKFTLIjQVEKLBSVPRHTQY PDVP 
TRRELAKHI/TVTTOKVRVWPKHKfiARCRRHQREIJ^LANELRADP 
DDCVYIWD 



RPAGAGAIPEARARPPDVQAABEBKEHDLPDSASRVFCOEXLSM 
VNTDDVHAI nMKNMU3RPBKTHBMLLHFNNL3SARIiQQMSER 
FIH3TRTLVEMKRDIJD6 IFRRXRTIiK33KLARQKPBAPSHTPBAS 
PLBBEDBDPIPPSTTTTXATSEQSTGS CEXrSPDTVSPSLS PGFB 

dlshvqpgspadsghsqtddeemtge 




RGSSAAFRGSGIfiGAMIRHVl*WlGMGRGLLTRRPGTRRGGPSLD 
NDGKVSBIKKR1ES ILPGRSCTJIjIKJDTSHLPPEHSDVVTVGGGV 

lglsvaywlkkleshrgairvlvverijhtysqastglsvggi CQ 

QFSLPBHI CJLSLPSASPLRNINBYIAVVDAPPtOLR.FHPSGYLIj 
LASEKnAAAMESHVKVQPQEGAHVSLMSPDQLRHXPPWINTEGV 
AI>ASYGMKDEGWIT)PWCLLQGIJIRKVQSIjGVLFCQGEVTRFVS S 
SOHMLTTDDKAWLKRIHBVHVKKDRSLByQPVBCAIVINAAGA 
BSAQIAALAGVGBGPPGTLQGrKLPVBPRXRyVYVRHCPQGPGL 
ETPLV7U5TSQVYFRSKGLGSHYLGGRS PTBQEBPDPANLKVDHD 

PPQDKVWPHIiALRVPAFETLKVQSAWAGYYDYNTFDQNGVVG PH 
PLVVNMYFATGFSGHGLCXJAPGlGRAVABMVLKGaPQTXDLSPP 
LPTRFYLGBKIQHHNXI 



TGLTSSCLPLQVWHTKRTKDMGKF3SVTVSTIDERBBEIBAREV 
ADSY7^JNAKVIKKQIiERKr243 KRRLQBIABLSAKKAKMKGTLID 
KQPK 
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Amino acid segment containing signal peptide 
lAeAlanine, C-Cyoteine, D=Aspartic Acid, 8=> 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H^Histidine, I«*Isoleucine, K= Lysine, 

L= Leucine , H^Methionine, NoAsparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

S=Serine, T=Threonine, V=Valine. 

H»Tryptophan, Y=Tyrosine, X=Unkno*m, *=Stop 
Codon, /^possible nucleotide deletion. 
\=possible nucleotide insertion) 


Hj 


2 


915 


GLLHNMALLVRVLRNQTS I SQWVPVCSRLI PVSPTQGQGDRALS 
RTSCHPWISQSQACGGSEQXPGIDIQmRKtHTTRKLSTTKDS P 
QPVBBIWxftFTXIIHAMGFTGPIJCYSEWKIKIAALHMYTSCVBK 
TDFBRPFLRC054PI7XFHS?fFLITLLHVW<CLVRMKQEGHSGKYM 
CRI IVHFIWHDVQ0RGKV>CVHPYILKKH)II3JITNHPYAAXLGY 
DBGILSOTKGlAAALHRTPFHRKCEDPHHLELIiVBYVRICQ iqyl 
DSHHGBDLLLTGEV&WRPLVBKHPQSILKPBSPTYHDBGL 


6358 


■ 


1040 


ASDALH3XjSAPVLRLSSRSA)\RPATHTBQAISF 7VKDP1AGG1AA “ 
AISKTAVAPIKRVKLIJjQVGWASKQIAADKOYKGrVDCIVRIPK 
EQGVLSFWROHiANVIRYFPTQAliHFAPKDKYKQI PLGGVDKHT 
QFHRYFAGHLASGGAAGATSLCFVYPLDPABTRLAA0VGKSGT8 
RB FRGLGDCLVKITKSDGI RGLYQGFSV5VQGI IIYRAAYFGVY 
DIAKGMLPDPKNTHIVVS KHIAQTVTAVAGVVSYPPDTVRRRJW 
MQSGRKGADIHYTt3TVDCHRKIPRDEGGKAFFRGAW3NVLRG>IG 
GAFVLVLYDKLKKVT 


6359 


98 


1086 


VCSQKB^CMKEDCLPSS HVPXSDS S3 IQXSELLGIkLECTYNCYHB 
GKSFQLRHRKRBGTLI IEGLI231ARGLRRFlRliGHQDDR£GVHL 
PSTSSHPRRPSCPIJCBPSPQNGNITAQGPSIQPVHKABSSTnSS 
QPLEEABKAPQLHRTKS DASCMSQHRPBCDAPGBAQH I RRHRFS 
XBGHFYNHKTSVFrPAYGSVTNVRVNSlKTTLQVLTLLIJTKPRV 
EDGPSEPALYIVHRSGBRTKUCDCBYPLISRILHGPCBRXARIP 
IKBADLGVHVPHBV7V0YI KFEMPVLDSFVBKLKEBBKREI IKLT 
MKPOALRLTMLQRLEQLVEAK 


6360 


1 


mm 


GTRGAVPSTLEKWLP PR S CR VPW IHSGTTKS KVS PKI TliTSDP 
RLPYKVLSVPSSTPFTAVLKFAAKBFKVPAATSAIITbrDGIGIH 
PAQTOGNVFLKHGSKLRI 1 PRDRVGSC 


6361 


615 


158 


RPGLGQLQHCALAPQAjGyjnblCRFHGRIWA^ 

PKDTLWTPLPDSS PVAVPLGAP IAVASTLSVEHNDGVBTGI WAC 
APGHHRHQITSOBFCHPIQGRCrFTPDDGBTLHIQAGnAlrtLPA 
KSTOIHDIQBTVRKTYVLIL 


6362 


350 


■ 


TTMIXJSHSAALKLQQI*PPTSSSSAVSEASPSYKENI*IGALLAI p 
GHLVV 5IAI1CLQKYCHIRLAGSKDPRAY FKTETWWLGXiPUILLG 
BLGVPASYAFAPL3LIVPLSAVSVIASAI IGIIFI3CBKWKPKDF 
LRRYVLSPVGOGIAWGTYLLVTPAPNSHBKWTGEHVTOHLVSO? 
PFIXYMLVBlILPCUtLYFYKBKKABNIVVIXJiLVALlJQSIVTVV 
tVKAVAGMLVLS IQGNIX>IJ>YTIF?VMP7CHVA!IAVYQAAPIi6Q 
ASffilYDSSUASVGYILSTTIArrAGAIFYLDPIGBDyifilOlP 
ALGCLIAFLGVFLITRNRKK?IPFEPYTSMIlAMP<3fQSiMHDXEM 
TVQPRTJCASPSYGAI^HNDWISEriAPATLPVHQKKHGSRSASG 
VPYKVLKHTKKB 


I 


21 


1201 


RRIEUjGSSFPRRRDSSAMESYITVIAHQPVVIDHGSGVXKAGPAG 
DQIPKYCPPNYVORPKHVRVMAGALEGDIPIGPHAHSHRGLLS I 
R YPHKBG IVKDWNDHHRI WQ YVYSKDQLQTPSBEHPVIjLTBAPL 
NPRKlteERAAEVFFBTFNVPALPISMQAVLSLYATtSRTTGVVLD 
SGDGVTHAVPIYEGPAMPHS IMRZD1AGR0VSRFLRLYLRKEQY 
DPHSSSBFEIVKAI KERACTLS INPQKDETLBTBKAQYYLPDGS 
TIBIGPSRPRAPBLLFRPDLIGKBSBGIHBVLVPAIQKSDMDLR 

rtlfshxvlsggstlprgfgdrulsbvkklaprdvkirxsapqb 

RLYSTMIGGSIIJtSLDTPKKMHVSKKEYKBDGARSIHRKTP 




21 


1201 


RRTRLO35PPRRRDS3AMBSYDVIAHQPVVltaiGSGVXKAGPA0 
DQIPKYCPPNYVGRPKHVRVMASiLBGDIFICTHABEHRGIjLS 1 
RYPfrEHGIVKDWNDMERIWOYVYSKDQLQTFSEBHPVX^TEAPL 
HPRKNRERAABWFFHTFNVPALFISI'Qft.VLSLYA'PGKTTGWLD 
SGDGVTBAVPIYBGFAMPHS IMRIDIAGRDVSRFLRLYLRKEGY 
DFHSSSKFE IVKAIKERACYLS INPQKDBTLSTEKAQYYLPLGS 
TISIGPSRPRAPBLLPRPDLIGBESBGIHEVLVFAIQKSDKDI^ 
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RTLFSNIVL3G03TI(FKGFG0£IiLSBVKXLAPKDVKXRISAPQE 
RLYSTW IGGS IIASLDTFKKMHVSKKBYKKDGARSI31RKTF 


6365 


234 


19B9 


■ESKSRASSAHA^ScPSK^BVHSRPR^ESSISSNSG^OTr" 
ASWGTIiAPDgYGHPPLlIKDQlIBgSRlJ*3Lg?kLgSHIKAAKftVR 
MTKRTCLGPH<nj>KKKVDRIX3DVTW7nX3ATII*3»MDVIfflQIAK 
LMVBLSKBODDB I(93GTIGVVVIAGAI«LBBABQLIiDHGlHPXIlX 
ADGYBQAAHVAI BHEiDRISDSVLVDIKDrBPLICnAICrriiGSICV 
WSC3IRQMABXAVHAVI.TVADKBRRDVDFELIXVEGKVGGRLED 
tki»irgvtvdkdfshpqmpkkvhdakiailtcppeppkpktkhk 
LDVTSVEDYKADQKYKKEKFEBMIQQIKBTC 3U9I«AI0QHGFDDS 
ANHI«IiLQNHLPAVRHVGGPBIHLIAIAIt3GRJVPRPSHLTABKL 
GPAGL\q^SFtnTKDKMI»VXBQ(TOSRAVTIPrRGGmWIIEB 
AKRSLHDTVIiCVIRHIJHDHRWYGGaAABISCALAVSQKADKCP 

tlkoyamhapaiiai^bvipmai^sbnsgmhpiqthtevharqvkkm 

NPMiGIDCLHKin'OTDMIDQ^VTBTIdGiaCQQISIiftTQMVHMIIiK 

IDDIRKPGESBB 


6366 


257 


1B9B 


GMKSGAHSSTFWVLLS 1 KbGAVAMLCKEQGITVLGfJftVPOIIiV 
IGKFNVIiBIVQECVIjHKDKSLBNL(9I1iRHGGXLFRI 1TLLT6GGAG 
MLYVrooafOTGPPAPTHVDNPT^PMJSMLVHAVHYNYYYSLHA 
WLLI^PWnLCFDWSMQCZPLI KSISDHRVIAIiAALWFCUGLIC 
QALCSKDGHKRRIL»TLGLGPIjVXPF1jPASNLFFHVGFW7VKRVI, 
YLPSVGYCVLLTFCJPQM*3ICHTKKKKLIAAVVIjGII*PINTLRCV 
LRSGBHRSKBQI*FRSALSVCPIilWCVHYKIGKNIiADKGNQTAAI 
RYYREAVRLNPKYVHAMNNIiGNILKBRNBIjOBAEKLLS LAVQIQ 
PDPJUUVNKE9LGrVQN5LKRFBAAEOSYRTAIKHRRKYPDCYY2JL 
GRLYADLNRHVDAI JJAWRNATVI,KPKHSLAWNNMI I EJLDNTGNL 
AQAEAVGREALBLIPZ3DH5U4FSLANV1/3K5QKYKESEALPLKA. 
IKANPN7vASyHGNlAVLYUHWGHLDLAKK}IYEISLOU)PT7VSaT 
KBNYGLliRRKLHUfQKKAV 


6367 


287 


1934 


3 IGPP VMLVI^ IIAYTCEMFQDSVAFE0VAVS PTQKJKHALIiDPS 
QKMLYRDVMQ5BTFXNLT9VOKTWKVCSfIKDBYKNPRIlNLSLMRB 
KLCBSKSSUHOGBSFHQIADDMUJHBTLPGITPCBSSVOGEVGT 
GHSSLBTHIBMWT5HKSSHYQBYGKHPYKWKBCKKRPSYIJ3SPQ 
SHDKJU2TKBKPYDGKBCTKTPISHSCIQRHJIVMHSGDGPYKCKP 
05KAFYFUIliCIilHBRIB3OVKPYKC8QGG3AF IRS 1 IbPVHBH 
THIOVKADBCKBCXaiAFSPPSK IKHHKRSHTGEKP YBCKQOOKV 
PISPSS IQYHKMIHTGKKFYBCKQCX3KAFRCGSHLQKHIIRTHTG 
BKPYECHQCOKAFRCTSDLQRHBKIHTBDKPYGCEQCGKGFRCA 
SQLQIHBirrHSGBKPlffiCKBOGKVPKXPSSLRIHBRTHTGBKPH 
BCKQCGKAFRYFSSLHlUKR'iTiliaJKP YKCXVOGKAFTCSSSIR 
YUBRTirrGBKPYBCXHCGRAPISNYIHlfBBRTRTGBKPYDCRQC 
GKAFIRASSCRBHKJtTHTIHH 


636B 




327 


I^VPAKIJ3PR6MPRTAGALPLRPPPLTMAVPHDHVBI KDPQYDB 
DSSTYFYPCPOQDHPSITKBDloBNGEDVATCPSCSLIIKVIYDK 
BQ FVCG ETVPAPSANXELVKC 


6369 


1 


1745 

L 


AGCCROTRFPTPIBGPGSliCHNyCESARCTVnrRTIHGSPREDTG'F^ 

PRSREMWFQDSWVFKDVAVSFTQKBMALLDPSQKNLYRDVMQKT 

ITCNI>TSVGWPWKWQHI BDBYKHPRSNLSLMSBKLCBSKBSHH06 

SSPNQIADIMLNRKTLPGITPCBSSVaSKVGTCSSSLHTHIRAD 

TGUKSSBYQBYGHNPYRNKBCK3CAPSYIiDSFQSHDK3\CTKBKFy 

DGKBCTBTPISH3CIQRHRVMHSGDGPYKCKFCGKAFYFLNIjC3» 

IHKR1HTGVKPYKCKQOOKAPTHSTTLPVHBRTHTGVHADECKB 

CGNAFSFPSBXRRHKESHTGBKPYKCKQCGKVPISFSSIQYHKM 

THTGEKPYECKOC3GKAPRCX5SHIjQKHBRTHrcKKP r YBCRQCGKA 

FRCTSDLQRHEICIHTKDKPTCCBQOGKGFRCASOIXJIHKRTHSG 

RKPHECXRCGKVFKYFS£LRIHKRTHTCBKPHBCKQCGKAPRYP 



494 
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location 
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residue of 
amino acid 
sequence 


MBsi 








SSimHBRTHTCDKPYKQCVOQKAPTCSSSIRYHERTHTCBKPY 
HCKHCGKAF ISNYIRYHKRTHTGKKPYQCKQOGKAF 1 SAKS CRE 
HKRTHTIN3 


1 


1711 


329 


FVLSgQRUtTKRTKPHgPQLQRGJWWUWGAgEAGftfTTiI.RT.IiriGCG 
iUjVGOLRPVlMTTPAHACNASKTHKLSLYBLHRTPQBAIMDGTB 
IAV3PHSLHSgmCP10iPM«HHrtHTTKBCmRFCSDCIVTALR 
SCaSKBCPTCKKKLVSKSSUSPDPZSFimLZSriypSRBBTKAHQD 
HVldRbSRLHJJQQAliSSSXHKGlJBMQAI^HBAQRVRRPlPOSDCrr 
TTMSGGEGB PGKGECDGSDVSSDSAPDSAPGPAPKEPBGGGAGG 
SSVCTt3GGGTGGVGGCa6SBDfiGDIlGGTLGGGTI/JPP8PPQAP3 
P PBPGCTIKLVPBPHPXiVEKGBYCXrrRyVHTTGNATVDHLSKY 
LAIxHIALBRRCXjaBAGBPGGPGGGASDrTGGPDGCGGEGGGAGGG 
DGPK2PALP$LBGV3KKQYT1 T lAPGGG»PTTO^LTL8I»VaB 
KPWKVSRPLSbCTAPTKDPK 


6371 


3 


288 


GVWOfSTAMNFtTTKSFOP^PDKBSFWjDHiJGKClCSFHBK^MKE - 

LHHNITFKNALCRKBSKEYLBCEMBRZLNLQBPLKKIjGPGPIjTSG 

KSBAXX 


6372 


2141 


62f> 


HVSJUTISBGBCABIKYKKIiKDIJljBJCSF^LVKMPSIjQFVVMCVMKH 
LPKVPEKKI>HiVMADKKLYRACAVEVRRQIKQDHQALFGDKV5P 
LIiKQTXLKKKSALFSTKLSVLHNFFS PSPKTRRQGEWQItLTRM 
VGDfTIOjYDWVLQFI^TltPTiRTRMVHYCTLRAKT J .HSLHPLDVG 
El CT\TOPCHlFrVClJlACIRFaFVDS KRASHliQGFLDGVKKGQE 
QVXGDLSMIIXDPFAXHTUMjSTVHHLQKLVGCKTLPEDS pdli> 
LLLRLLALGQQAMDKIDSQVFKBPKMEVKLITRFLPMLMSPLVD 
DYT PNVDQKLP ABB KAPVS YPNTLPES FTKFIjQEQRMACSVOLY 
YVLHITKQRNKNALIJtt»IiPGLVBTPGDLAFODI FLHLLTGNLAL 
LADBPALHDFCSSLFDGPFLTASPRKENVHRHALJUjLIKUHPRV 
APSKLEAXjQKALBPTGQSGEAVKKIjYSQLQEKLEQI^DHRKPSPA 
QAAKTPALKLPLPSVPAPAPb 


6373 


67 


711 


PSRAARASPARLPAWViw I ISHLVVblFGTLYPAYYSYKAVKEK 
DIKHYVTCHMMYlfllPALPTTABXFTDIPLCHFPFYYKLKIAFVA 
HLLSPYTRGSSl«I*YRKPVHPTIkSSKBKBII}DCLVQARDK5Yl>AI* 
VHPGKRtMVAATAAVHAASKOQQALSERljRSPSMQDIffTXROD 
GAPAPSGPPPPOSORASGKHGQPKHSESA3BSASS5GXA 


6374 


535 


2105 


IOa*FC5YI5T3EFPS37RBa5CPlBTFQnrrSSTlPLSSTRJ}H5 






I 


HSCPTHTSCWYTSSTXPI»S5TRDHSCPTHrFCMYMtPlXBI«SS C 
CPAHLQTBGSNGKKSVtSGFQWLBDTVLPPEGGGQPDDSGT rw 
DISVLRVTRRGBOM5HFPQTPLDPGSOVLVRVDWERRPDHMQQH 
SGQHLI IAVAKn*FKLJCTTSWBLGKPRSAlHLDTP3MTAEQ\rAA 
I BQ3VNHKIRDR1 jPVNVRBLS1jDDPEVEQVSGROIiPDD}IAGP ir 
WBXBOVD 8lOtCO6THVSHl»SDLOVlKrLGTBI0GKIOiHTMLlFL 
St2CWliKKHHR3EGTHKALTAIJjKQG5fVBDHVBAVKICT<INSTKTL 
QKNNUniLRDLAVHXAHSIiRKSPBWGGVVrijHRKEGDSEFMNII 
ANBIOSB3TLLPLTVI33iEKG6GLPLLAGPPASVBTI/>PRVABVL 
EGKGflGKKGRPQGXATKMSIUWBAQAIJ^YISTQSAKS 


1 


1 


1535 

i 


AIHAAATRPVRLPBAGCEOaHRCWNPSHSBSHSOBOQLAAHSRT 
CPGBPBRPGQQWRGPTWbVTAYLAPVGlJiASCI jGXiKLSRCRAK 
PPGRACSNPSPl&FOLDFYQVYFLALAADVIjQAPYLYlCtiYQHYY 
PLEGQIAILYVCXJLASTVXPGbVASSIiVDtflJSRKNS CVLFSLTY 
SLCCLTELSQDYFVLLVl39ALG6I»S17VLLFSAFEAtfYXHBHVBR 
HD^ABWTPATPAHAAFWHHVIAVVAGVAAKAVASHIGLGPVAP 
FVAAI PLLALAGAlJiI*RHJraBNYDRQPAPSRTXZASGIiSCLLSDrR 
RVUiI/mQALFBSVIFIFVFLWTPVLDPttSAPLGXIFSSE>!AA 
SLLGSSLYRIATSKRYHLQPHHLLSIiAVLIVVFSLFMLTFSTSP 
GQESPVBSFIAPIdiIKI»ACGbYPPa4SPLRRKVIPBTEQAGVLN 



I 



495 
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sequence 


jjWW 








WFRVPbHSIACIX7lJL.VIiiI^DRKrGTHWI1FSICSAVMVMAliLAV 

VGLPTVVIun>AKI»BVPSPTBBPfAPBL 


i 


380 


1437 




6377 


2311 


mm 


srirrrssrrprbppgpsrrrrrrrpdpktmpsbktpkqrrtfb 

QRVHDVHLI REQHPTK1FVI IKRYKQKKQbPVIDKTKFLVPDHV 
HMSBLIKI iRRIUiQbHANOftPFLLVNCTSMVSVSTPlSKVYBSB 
KDBDGFLYMVYASQSTFQOLSV 


6378 


686 


191 


GAGPHKAFPDGIGKaSRRARI>pgyXRPPGHVGGGDSGRRHMAVA 
DLALIFDVDIDSDGVyKYVLniVHSAPRSGAPAABSKBlVRnYX 
WABYHTVDIYDKVSGDfiQKQGJCDCKCIiGGGH I SHQSQD KKTHVYG 
YSWAYGPAQHAISTBKIKAKyPDYBVTWAKDOY 


6379 


35 


378 


KHAGSPS PSRAALRRCAPQRSQAPRHPI18AACRRS FQGSQGRAY 
LPMSVVNVOOTPAHBRVLbTGLHAVADrYCEHCKl 1 LGWKYKHA 
FESSQKYKBGKYI IBIiAHMXKDHGKD 


1 


1414 


462 


PAVQGQRGAOPPttiRGSGaiMARPALTVVRHGETRPNKKKI IQGQ 
GVDK PLS ETG PKQAAAAG I FlJfflVKPTflAFSS DLMRTKQTMHGI 

lkrskpcxdmtvkydsrlrkrkygvvegkai>sklramakaareb 

CPWTPPGGKTTJXJVKKRGIDFFKKIjOQLIIaKKADQKBQFSQGS 
PSNOLETSXjABI FPLGKHHSSKVHSDSGIPQIiAASVLWSHGAY 

mrslfdyfltdlkcslpatlsrsblksvtpntgmslfi infebg 

RBVKPTVQC I CMNLQDHI2iGLTBNSIiGLNLPS KSNHFKPUCGVP 
IALFTSliLC 


6361 


1668 


216 


avveaqgsrgpsgagwspIzoaaamnfsbvfklssllcxfd pdgk 

YI»ASCVQYHl»WR0tfKTLOIDQbYYCLDQIQHXEWSAD3LFrLC 
AMYXRGLVQVWSIiEQPKHHCKIDEGSAGLVASCHSPlXiRHILHT 
THFHLRITVWSLCTKSVSYIKYPKACLQGITPTR1XJRYMALAER 
RDCKDYVS IFVCSMfffiiLHHPDIDTQDI»TGIB9IAPWGCVLAVWD 
TCLKYKILIjYStDGRIiLSTTSAYBWSbGIKSVAITSPSSQFLAVG 
SYlXJKVRIIJiHVTWKMITBPGHPAAIHDPKIWYKEAEKSPQIjG 
LGCLS FPPPRAGAGPLPSS8SKTEIASVPVBI*QTLIQ?VTDRAWP 
X3GIGMLAFSPDSYFIATRffDNtPHAVWVm>IQKIiRi^VLBQb 
SPVRAFQWDPQQPRIjAICrGGSRliYLWSPAGCHSVQVPGEGDPA 
VZtSIXSTHLSGDSHALLSKDHFCLCFLBTBAVVGTACRQjZiGGBT 






1062 


PSBDBDRNLCDIAYPIiKGIQCGIVDIVDHSDCBPKS KLLRWTTNK~~ 

KHHVLBTB KTPKDIfVRQHRKBSKMKSHJCLBEBFBWIiKTCSBVLYY 

TVBKRGHISSQLKHYNPWSMKCH0QQLQRMKEN7VKHRNQYKFIL 

LKNLTSRYEVPCVbDlxKMGTRQBGDDASKBKAANQriLKCQQSTS 

AVIGVHVCGMQfVYQAGSGQLMPKNKyHGRKLSVQGFKBAIiFQFF 

HNGR YLRRELLG PVLKKLTBLKAVI^RQBSYRPYSSSLLVriDG 

KKRPEWLDSDABDLKDliSEBSADBSAjQAY70fKPIQAS3VDVRM 








IDPAHTTCRLYGBDTWHBGQDAGYIFGLOSLIDrVTBISBBSG 

B 




1061 


spapgrpsphgsopaaraaaapampsakqrgskgghgaaspsek 

GAHPSAARPLAAPTPAAPACRS PSPGGAPAS FPGRAPRSLASQP 

aaraaaapahpsakorgskgghgaaspsbkgahpsgoaddvakk 

PPPAPQQPPPPPAPHPOQHPQQHPQyQAHgKPgffiGGGGGGGKS 
SSSSSASAAAAAAAAS3SAS CS RRI/GBAUfFLPYLALVAAAAFS 

gwcvhhvlbevqqvecrshqdfseqrkelgqglqgvbokvoslqa 

TPGTFB3I LRSSQHKQDLTEKAVKOCTSKVSRISBVLQKLCBIBI 



496 
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Amino acid segment cont ainin g signal peptide 
begi nning nucleotide (A^Alanine, C- Cysteine, D-Aspartic Acid, 8= 

nucleotide location Glutamic Acid, F=Phenylalanine, G=Glycine, 

location corresponding H=Histidine, I=Isoleucine, KuLysine, 

correspon ding to first L=Leucine, H=Methiooine, N^Aeparagine, 

to first am^no acid P=Proline, Q=Glut amine , R^Arginine, 

amino acid residue of S=Serine, T=Threonine, V«=Valine, 

residue of amin n acid H=Tryp tophan , Y=Tyrosine, X^Unknovn, *«3top 

anrino acid sequence Codon, /-possible nucleotide deletion, 

seq uence X-poaslble nucleotide insertion) 

lamf j?r^tH vt/ ^ &PTODP ^k QnvgEM ^Ti^INIBilAIF~ 
TKVQKRSQXBINDMXAKVASLBBSEGraCpiiKALKBAVKBIQTS 

aksrbwdmkalrstlqtmbsdi ytkvhklvsijoqbqqafkbaad 

TKRLALOAIA’KKLLRSRESVSRLPBKIRJtLBBELRQLKSnSHGP 

kbdggfrhseafbalqqksqgu>srlq^vbdgfvlsmqvasarot 

BSLBSLL3KSQBHBQRIiAALQGRX*BGIiGS3EADQDGLA3TVHSL 
GBTQLVLYGDVBKIiKRSVGHLPSTVBSLQKVQEOVHTIiLSQDQA 
OAARIiPPQDFLDRLSSLDNLKASVSQVBRDLKHIfiTAVDSLVRY 
SVIZBTNKNHliCSAiaJIJiDnLRHDLIIRLPVIVBKIHHXV 



I WBVPV^TVIIJjHIjQQAKQ PXJ? P PSSSXHK8QADHAHHAIGEKR 
AAPOSGKKPBCTPKTKQQKDPMBPQKPVSAYAIjPFRDTOAAIKGQ 
HPHATFGBVSQIVASMWDSIiGEBQKQVYXRKTBAAKKBYIiKAIA 
AYRABbVSKAAAESABAQTIRBVQQriliRSTHUSSUJjHTFIiSQ 
HGTVSASPQHjQQSI»PRSIAPXPLTMRI»PJ(HQIVTSVT iaannp 
smGAPiaSSHGTTMVGSAPSTOVSPSVQTQQHQ^rLQQQQQQQ 
QCXWQQ«0QQQMXJHQHHQQIQQQ>^XX2HPQHHaKXpnXXXX)C^ 
UXXJINQQQUXX>UX5H1<MQUJH>5QHQSQPSPRC^PVASQI 

tspipaigspqpasqohgsoiqsqtqtqvlsqvsip 



6386 819 



IA0GPDAATTDKL3SLGSIJSKA1JGFASRRXDKFGPTVGSQGAB8 
ALKKVPLBVIiRQRBSKWliDKliQfHDKSSMAKKHKKIRI^lCQK^I P 
PSI^GRARQ Yl^GGKVKbQQNPG EQT3KLEWS PGDP KMUJVI KRD 

uirqfpfhkmfvsrgghguqdlfrviucaytlyrpbbgycqaqap 

IAAVIiU^PAKQAFHCLVQXCRKYLPGYYSKKLKAIQLDGEIL 
PSLLQKVS PVAHian>SRQKIDPU*YMTBWPMCAFSRTLPHBSVL 

rvwiwifpcegvkixprvglvllkhaugspbkvkacqgqybtibr 

LRSI*SPKIMQBAFLVQKVVBI*PVTBRQIERBHLIQIiRRWQSTRG 
SM)CRSPPRLHaAKAII»nAKPGPRPAIjQPSPSXRI*PUlAPLPGS 
KARP KPPKQAQKKQRKQtflCGRGQLEKP P APNQAIWVAAAGDACP 
PQHVPPKPSAPKDSAPQPLAPCfVSAHHRSOgSLTSQBSEDTYL 



TVCGS PY 1X3 IMQRASRXKRKlW4WVlJ^Pi^Ilx^^ 
RAQIIiGGANTPYBHGVFXLEVI IPKRYPPHPPQdPLTPIYHPN 
IDSAGRXCUinJEIiPPKQAWRPSIJlIATVIJTSlQUiMSHPHroP 
PIJ4ADI3SfcFH^KPAFIJCHARQWTEKHARQKQBADEBBMIJJNIj 

pkagdsrvhnstokrkasqlvgibkkfhpdv 



PGPTHASADAHADAHAQPNMA«HNKAAPPQI PDTRRELAKLVKR 
RQKIABTXANIfBRQIYAFBGSYLEDTQIfY'Qil XRGWI3RYLXNQK 
RSMSKHDRRNRKFKBAKRLPSKSSVTSAAAVSAIJUSVQDQLrRK 
RBPGSGTESOTSPDPHNQBHBPSQEDPBDUX^VQGVKPQKAAS 
STSSGSHHSSHKKRKNKNRHSPSGMPDYDFBXDLXUiKKPRADY 



pmVpilAft ftmvWADAWAQPMMAMHMKAAPPQI PJPTRRSLAKLVKR 
rQRT.AKTTANTiRgOIYAFEGSYi^PTOWinaflXRGWDRYLtTHDK 
NSNSXHDRRMRXFKBABRLFSKSSVTSAAAVSALAGV^QUEK 
RBPGSGT ES DTSPPFHNQgHEPSQgDPKDLPGSVQGVgPQKAAS 
SXSSGSHHSSHKKRKETKNRHS PSGMPD YD FSTDIiKLNXKPRAD Y 



IXa^GNLCTKBSYDYLKTtWn>VHIVRGBPI)ENI^YPBQKVVTVG 
QPKIGLIHGHUVXPW3»fA3LAIiU25?QroVDILISGHTHKPRAP 
KHENKFYXNPGSATGAYNALETT7X XPSFVUTOIQASTVVTYVYQ 
LIGDDVKVERIEYXKP 



LTIAQRDDGVFVQEVTQNSPAARTGWKBGDQIVGATIYPDHnQ 

SOBVTQUimiGaHTVGLKLHRIDGOaFFPSbGQTODP 



VRWNSKTBCTYI>3IQTQSNFPANliNKLVNCIVXSSLVTTQRKLKA 

HSLLGSSHQIjARAVLITPNpMDFCTKDLLTrTSERIIAYIiRDFNH 

DQKKAIBTAgtfWKHSPSVAKICblHGPPGTGKSKTIVGLLYRL 




















beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



cted i 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



393 2017 



394 1418 



6397 | 391 



LTKNQRKQHSDENSHAKIKONRVLVCAP SNAAVDBLMKKI IltEF 
KEKCKD KKNPU^CGDINLVRLGPKKS INS HVLKPS LDSQVNHR 
MKKHLPSHVQftMHKRKBPL0rQLDKLSHQHAI£Ea^BIQRQKL 
DSNISKVSKKRUSlASKIKKVQGRPQKTQStllLKSHIICCTLS 
TSGGUABSAFBGQQJVPFSCVIVDBACQSCKIBTLTPLIHEOf 
ICbIDVCTPKQLPPTVXSMKflQHYGYDQSMM3\HFCRI»LKBNVKHH 
misrlpiixjltvqyrwhpdiclppsnyvymrnlktnrqtkairc 
SSDWPPOPYL V PD VODQS gRHDNDSYINVQgXKLVMKI IKLIKD 
KRRDVSFHNlGIlTHXKAQirnflQKDIiPKBFDRKBPAEVITTVDA 
FQGRQIGDCVIVTCVRANSIQGSIGPIjASLQIIUJVTITRAKYSLP 
I LGHLRTWffiNQHWNQL I QDAQKRGA.I I KTOTKNYRHDA VK3 LK 
LKPVtXJRSLTHPPTIAPBGSRPQGGLPSSKLDSGPAKTSVAftSL 
YHTPSnSKBITL7V7SKDPERPPVHDQDQDPMXKRMGIBVKG0 
X ETjWDPQPSSPQHPGftTP PTGBPGFPWHQDLS HVQQPAAWRA 

lsshkppvrgbppaaspbastoqskcddpbbblchrrbarafse 



GRTGVDlASSHAHRliQXRIdjTVfDVianXIiRUuiPUaBAYJLrXAR 
AHGLBVBPSALBQGFBQAYRAQSHSPPNYaLSHGLTSRQHKLDV 
VLQTFH UAGVQQAQAVAP IAEQLYKDPSHPCTWQVLDGAEITrLR 
BCSTRGL&LAVXSNFDSRLSGUiGGLGLRBHPDPVLTSEAAGHP 
KPDPRXFQEALRLAEMBPWT^fiHVGDNYIjCDYQGPRAVGMHS PL 
WG PQRLDPWRDSVPgKaXXf PSLAHLLPALPCLBGSTPGL 



TGGS KMAAVATOjSVAASTGSAVAT AS KS NVTS FQRRG P RAS VT 
NDSGPRLVSIAGTRPSVRKGOIJjVSTGLPAliDQliLGGGLRVGTV 
LI, I KKDKYW IYSPLL FKYFLAEG I VNUHTIXVASAKHDPANXLQ 

blpapllddkckkbfdbdvxkhktpksnikmkiawryqllpkme 
IGP VSSSRFGHYYDASKBMPQBIjI basnwhgfplpbkisstlkv 

bpcsltpgytkllqfiqzjiiybegfdgsnpqkkqrnxlrigxqn 

IX^PLWGDDICCABNGGHSHSLTKPLYVLROLLRTSIiSACXXTM 
PTHLIQjNKAI IARVTTLSDVVVaLRS PIGSBRBTNPLYKD YHGL 
XBIBQIPRLHHlaXCDESDVXDIiAFICIilCHXIkFTIBRLHCiPPDXiSD 
TVSRSSXMDL2VBSAKKLGPGOGMKAGGKKHLDP 



VLBBEPRWLLHDKVHA 
VLKQLQDILKEASU^gTLPGSGTBGPAKQBHFlLGSOgXDqVKB 
VLTLQGZPVLSQADVHLBKP , BNNC}LIiHPAFRBDEQHKLQQIQI2A& 
NWSOAIYIXTSHIX>SYQPKTOABVlJCIi , QPWMIj< 3 LTRARfniI#T 
TPATLTIiPBIAASGLTI^4PAPAIiPSDLLVNVYIN1i1CCI*CLTVYQ 
LHALQ PNSTKNPRPAGGAVLHS PGAMFEWGSQRIaKVSHVHKVEC 
VIPWUJDALVYPTVSIiQXjGQQLKDKISVFSSY^SYRPP 



PSGRPT3VLOCAARRGAAKHGGSVSGHPAGRTPTBTSNPGSSVM 
KSVTPKDVAVHPIQKKAUJJSARRSLCICYTRMLDQCRTIjASKGTP 
PCKPS CVBQLGQBABPKATERGHiRATGVAVBS qlkpbblpsmq 
DLLKBASSRCMQ ttGPGLFLEMQLVPS IBKRKTPLTREDRPALQB 
PPWSLGCTGLKAAMQIQHWI PVPTLGHRNPWVARDSGB 



ANILSSPSKRGQKBTLIGYSPEGTPI»YNFWGDAFQHSSQS iphf 
IKBSLTOI i£BSDSRQIFTFCtCXJiLLPTFVBLFYGVLTKSLGLI 
SDGFHMLPDCSALVMGLPAAU4SRi«KATaiFSYGYGRIBILSGP 
XNGLFLIVIAFFVFMBSVMlLIDPPHLMHMLTPVSVGGIilVlirL 
IGICAPSHAHSHAHGASQGSCHS3DHSHSHHMHGHSDHGHGHSH 
GSAGGGMNANMRGVFLHVlADTLOBIGVIVSTVLIBQreWFIAI) 
PI/CSIxPIAXliIFLSVVPLIKnACQVIiLLRLPPBYEKKLHIAIiKK 
IQKIEG1*ISYRDPHFWRHSASIVAGXIHIQVTSDV1»EQRXVQ0V 
TGIIiKE&GVNNLTIQVEKEAyFQHMSGI^TCFHDVIiAMTKQMES 
MKYCKDGTYIM 



GAGGVGRFKAlHAPAHMIKWQIDRLGKKVRVXCKTDDTIGDIiK 

KI*n\AQTGTRHNKIVIiKKHYTIFKDHVSIXH)YBIHDGHNIiBLYT 
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ID 

HO: 


Predicted 
beginning 
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location 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
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to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(AaAlanine, 0*Cyeteine, D= Aspartic Acid, 
Glutamic Acid, P«Pbenyl alanine, G=Glycine, 
B^Histidine, I^Isoleucine, KaLysins, 
L^Leucine, w=Hethianine , N=Asparagine , 
pti Proline, Q=Glutamine, R=Arginine, 

S=Serlne, T=Threonine, VoValine, 
W=Tryptophan, Y^Tyroaine, X^Onknovn, *«=Stop 
Codon, /^possible nucleotide deletion, 
\apossible nucleotide insertion) 


■■Hi 


■ ■ 


■ ■ 




6398 


■ 


1306 


HKQf«3PLlNRCKKXIiIiPTTVPPATPlRIWLLGGLLPPIaLLLSGIiQ 

rptbgskvaikidpdpapgsfiidqyqgcskqvhekltqgdyptk 

DlKAQKNYFBMBQKAHLAfifljHQGKVLPQIOfTTTHAVAIbFYYIS 

SNVHSDPTBAKASVARTPQQYEaSPHPKYLgYYLT9AIQliDRKD 

SIWKSBTLCYBVHYRTKDVHPOTVYTGATIRPGQPLSTSXilJaSBA 

QBTGHQTLPTlFrCDCIAPVQYFSIiKKEVLlPPYBLFKVIHMSYH 

PRGlWljQmBYSiLSnrYHOQLljKASSKRCIPDPIAIASDSPLTS 

VIIPSKSRV 


6399 


75 


■ 


PNLBTYFGRRC^KD^Oft^PTHTPVCJtKRTVVS KRGVAVSGPTK “ 

RBGMADSt£STPIiPSPBDRIJUaCTSKBU£yYQKKMABCBABH 

EDUiKKU&YKBACEG9QHXLECPI&QRKESIAELQKAI£DMQVC 

LFQSIBHVmLYSBHDRIAIBBUDKKKICWIdi&LVOrDRGBVT 

YFCKKPPHKVTlljQ!CnQAV0ECEQ3B33APKADPKXSKRRPSR 

ERKBSSBBYQRDIQTDUjQVKAIXIAQIXJBQTKLSRBQIBGIjIKD 
RRIHIjEEIQVQHQRHQHICIKELXKISIiHHTQKLLTBSTKDPLQtiR 
SBateWKKKSWHLBiaailJtSlCl KQYRVQCKKKKDKIGKVLPVKHK 
5HHAQSBYIKVMSLOQIBVVYFSGRVBGIPXHLQFVM 


6400 


2520 


1053 


~ K174KCI>BVVyBVQSAIIjRlIHCCjyAMKTGKPPHNIrf1BRKDPETWL» “ 
DNISVTFI^/TDUiKHBTLDIItl SJjSGAVQLRHLaNNbKTIXKR 
DPLELLPLSliS FYUiKKCiDPQTLLTCCLVGKQWNCYISACTEVN 
QTACianiAJQIIJDSVQDALHHKKVYIiKArLRMKQLBDHEAFETS 
SLIGElSAKVYALYYKDGLiCTGSDDLSAKLSfIJVSTGQCVYGIQT 
HTCAAVKFDBQKLVTGSFDNTVACWBWSSGARTQHFRGHTGAVF 
SVDYNDBMILVSGSADFTVKVWALSAGTCUTrLTGHTBWVTKy 
VLQKCKVKSLlASPGDYTLL&ADKYKlKIHPIGRBrNCKCLKTL 
BVSKDRSICUQPRLHPDGKYIVCSSAUILVQVJDEAS YDILRVI K 
TPBIAHLAUX3FGDI PALbFDNRYLY IMDLRTBSLI SRWPLPE Y 
RKSKRGS8RTJV3KASWLNGLDGSNDTGLVFATSMPDH3 IHLVLW 
KBHQ 


■ 


109 


766 


PGAAWSRPDItRGCXrTGPQPALRHLVl. PS PCPQP-LAJ? S S VBTkfiG ' 
PPHRTCRSPBPGPSSSIGSPQASSPPHPNHTLLIErrQGVPYTVl, 
VDEES0AEPGA9GAPGQXKCYSCPVCSRVPEYMSYLQRHSITH3 
EVTPFBCD I OGKAPKRAGHLARHHS OTACSGGRPHOCPIjCPRRP 
RDAGBLAQBSRVHSGBRPPQCPHCPRRPMEQIITLCIKBTHHKHP 


6402 




279 


TTSQCXAJIRQSSAIFTOSMKPAAICUMAIJiIAPKSOTPKOBH 
GAKNSHQDGCWrBSSBSS BTCS S KSHDGDKPI PAP^SS PLRKQB 
LBNLKCSILA£SAYVAIiALGIS 1U1ALNHADELI^X)PKLSGSLKP 
LGHLYAAKALISLDftlSDAITHLNPKNVTDVSliGISSNBQDQGS 
DIGHNERrtES S GKRAPQCYPSSVNS ARTVHLFNLGSA Y CLRS EY 
DKAElKCIjHQAASKIHPKEVPPBAILLAVYLBbQHC2frQliAl<QlI 
KRHQLLPAVKTHSBVRKKPVFQPVHPIQPIQMPAPTTVQRK 


6403 


5 


1690 


RGIHI^VI>QGNL[SWHYSBNVVII4HI2WI<7LTQVQQKHLI*IT?IX3 
RSVDOTSQAlQSlKHDFQHIjCX3VFLQAKHirrDWIiKBKVQSLQTL 
AAHHSALAKAHHDTLBDMHSQIJ^PTGC^ffiHrrTISOANEQNIiK 
DliQDLHKDAKNSTAIKPKOLEERFQLFBTDIvm ISNISYTAHH 
ItRTLTSNLHKVlffl’CTDTbTKHTDDLTSIJWTLAHIRLDSVSIJt 
KQQDI24RSRLDYBVAHLS YXKBBMKLVDSKBGQDIKKPT ILQGP 

pgpbgprgdrgsqgppgptgnkgqkgbrqbfgppgpagbrgpig 

PAGPPGBRGGKaSRSSQGPKaSRGSPGKPGPGGPSODPGPPGPP 

gkeglpgpqgppgfqglqgtvghpgvpgprglpglpgvpgmpgp 
kgppgppgpsgawpialqnkptpapednscpphwknftdkcyy 
FSVBKBIPBDAKLFCBDKSSHLVPIMTBEBQQHIKXQHVGRESR 
WIGLTDSEREHKWKWLDGTSPDYKNWKAGQPWfWGHGHGPGBDC 
AGJjI YAGQMNDFQCKDVHHP I CBKDRBTVLSSAL 


6404 


1012 


222 
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Amino acid segment containing signal peptide 
(/UAlanine, OCyateine, D«Aspartic Acid, B= 
Glutamic Acid, P^Pheny 1 alanine , G=Slycine, 
H=Histidine, I -Isoleucine, K^Lyslne, 
l>= Leucine, M=Methianine, N=Asparagine, 
P=>Proline, Q=Glut amine, RwArginine, 

S=Serine, TVIhreonino , Vo Valine, 
^Tryptophan, YsTyrosine, X=Onknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








R PAl/3LSGGSLVSMI*ARKLPAAVAPAGPA5LARWTXiG FCDKRLV 
PPDHAESTYGLYRTHLLSRLPIPBSQVITIKPELPVEEAAKDYA 
KKLRQAFQGDS I PVPDLLILGVGPIX3HTCSLPPDHPLLQBREXI 
VAPISDS PKP PPQRVTLTLPVLHAARTVXPVA2XIBGKAAVLKRI 
LEDQRBHPLPAALVQPgrgKLCWFlDHAAARIil/FVPPBKHSPI, 


6405 


1 


145? 


AALPRPTPRAPLGREGTGSDSSMAASffFYGRLVAVAXI^lNHRPR 

TAQRAAAQVLGSSGLFNNHtn#QVQQQQQR2fLSIiHHYMSMKLLQB 

agvsvpkgtvakspdbayaiakklgskdvvikaoviaggrgkgt 

FBSGLKGGVKIVFSPEEAKAV5SQH1CKKLFTK0TBEKGRICS0 

vlvcerkyprrbyyfaitmkrspqgpvligsshggvn ikuvaab 

TPRAI IKKP3DIEBGIKKEQALQLAQKMGFPPNIVBSAAENHVK 
LYSLFLKYBATMIBIKPMVEDSDGAVLCMnAKINFDSNSAYRQK 
KI PDI^DWTQEMRDKDAAKAlfliNYIGLDGHIGCLVNGAGLAHA 
tkdiiklhggtpanfldvgggatvrqvtbafklitsdkkviail 

VFTIFX^IMRCDVXAQGIVHAVKDIiKIKIPVVVRIiQGTRVDnAKA 

LIADSGLKJIJVCTOLDBAARHVVKLSBrvrilJUQQAHVDVKPQLP 

I 


6406 


1036 

" 


167 


yPRQMRGKDTi> P YSSGR YDS I KTBVSGCPBDLTVGRAPTAD 

DDDDDHBETHElJNEKMHDSBGMDPBRLKAFNMFVRLFVDBNIjDHM 

vpiskqpkbkioaiibscsrqppbpqbrarkrirtylkscrrmk 

KNGMBHTRPTPPHI/rSAMAEGXIkAAACBSETRKAAKRMRIiBIYO 

SSQD5PIALDKQHSRDSAAITBSTY5LPASSYSQDPVYANGGUJ 

YSYRGYGALSSNLQPpASLQTCaNHSNGBSGEARAIiASRPAPSWV 

craalgsgnqrgkqrpvmkrgclta 


6407 


492 


150 


VGLCIAVSQTVLAQLIVUAWPGQVAQI^CT^PQHVTIRDYGV 

SWYQQRAGSAPRYLLYYRSKBDHHRPADIPDRFSAAKDEAHNAC 

VLTISPVQPRDDADYYC3VGYGF3P 


6408 


1458 


903 


^^CITSSQAWRLFt^VTRGFNMRIBKCYPCSGPIYPGHGMMPVR 
NDCKVFRFCK5 KCHHNPKKKRKPRJCVRWTKAFRKAAGKE LTVUN 
SFBPBKHRNKPIKYQHKLHNXTinAMKRVEBIKQKROAKFIMNR 
LKKNKKU)KVQDIKBVKQWIHLIRAPIJU3KBKQLRBKMVQQLQB 
DVDMKDAP 


“ 6409 


iSo 


446 


NTALANLi, RCPTCDRLOBGCTAPAPPAHQGIVIiQ PVMPS CDPGP 
GPACLPTKTFRS YLPRCHRTYSCVHOAHLAKHDKLXSKS FQGS 
HGRAYLFNSV 


6410 


85 


607 


RGGTMK^CI^C3fG0SS3PKAAPPAGSACLPADSCPCLLF0AC ' 
AlSGLPNCITIHPLNIAAjGWMIHKAFIIiLIfCBAPFCCOPXBPA 
NTVAEKVDRLRSKQKAVFYCGMAVVP XVT SLTLTTLLGNAIAPA 
TGVLYGLSALG KXGDAI S YARIOQQRQQADEHiaABTLKaKL 


” 6411 


302 


772 


RLS 1MASSU4EDP8GSR1TYVKGDLFACPKTDSLAHCXSEDCRM 
GAGIAVLFKKKFGGVQKLLHQQKKSGBVAVLKRDGHYXYYLITK 
KRASHKPTYKiniQKSIiBAMKSHCLKKGVTDLSNPRIGCX5IJ)RLQ 
HENVSAMIEKVraATOIXITVYTL 


6412 


61 


1709 

1 


RPVTSFSPLPGSCGGRLGTimiLGHSLREVSAALKQGQITPTEL - 

CQKCLSLXKKTKPLHAYITVSgE?VALKQABBSHKRYKNGQSLta> 

UXSIPIAVKDNFSTSQJBTTCASHMLRGYTPPYHA'WVQKLLDO 

GALIJIGKTNIJDHFAMGSGSTDGVFGPVKNPWSYSKXJYREKRKQN 

PHSKNBDSDMLITGGSSGGSAAAVSAFTCXTVUiGSDTGGSTRNP 

AAHCGLVGPKPS YGLVERHGLIPLVNSKH3VPGI LTRCVDDAATV 

LGALAGPDPRDSTTVHBPXHlCPFMIiPS LADVSKLCIGIPKKYLV 

pelssrvosuiskaadlfbsbgakvievbijphtsysivcyhvlc 

TSKVASNMARPDGLQYGHRCDIDVSTRAHYAATRREGFNDWRG 
RILSGasJFPliLKENYENYPVKBQKVRRIiIANDFVHAPIJSGVDVIJ, 
TPTTLS EAVPYLKP I KSDNRTRS AQDDI PTQAVNMAGLPAVS IP 
V7VLSN0GLPIGLQFIGRAFCDCQLLTVAKWPBKQVOPPV1 QLQE 
MDTK^VLKNBKLASVSLKQ 



500 
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■MM 


6413 


2 


085 


HBPRCAGMAASiWPK3DI*EPYMDBNFlSHAFATM0ETVMSVKIIR 
RRLTG 1 PA&YCFVEFADIJCTABKCUUCDSGKPLPGATP AKRFKL 
NYATfGEQPDt9SPBT9LFVGDI»TPDVDDGKLYB7FVKVYPSCRG 
GKVVLIXJTGVSB3TTOFVKPTDKLEQKRALTECQGAVULQSKPVR 
LSVAI PKASRVKPVBYfKJMYSYSYNQYYQQYO^fYYAQWOYDQNT 
gsysysvpoycjytqstmqtykkvgddaledphpqldvtkankep 

WEQSBKLTDAUlDCHIfQPIiDTVSSBXPAKM 


6414 


1 


538 


RGGRAAIiLPNRRF pccrprpqparpssratpgprspgmats igv 
S FSVCTGVPSABKHAGBPENTY IIJIPVFQQRFRPSVVKDCIHAV 
IdCBKLAHAHYSPBBKPQLTKHbSKNIKDKLKEMGPDRYKMVVQV 
VIGEQHGEGVPMASRCPWIIMlTDIJYTHIIVPMNDStirCVVAAFGC 
PTY 


1 


2 


1168 


PVKQWQSSHRJUVCGIjGCEARAGGGEBPRGRASSVAGWVGAPRAP 
FZ KAAVAGLOVG5G£RRRGHKMPVHSRC3)KCSTNHHl>BKEVmCA 
BHBGSSSKDKDTKSSSVBEDODSSHMnDEDCERRRKBCLDBMSN 
LSKQFTDIin)QLyKSRLSQVEAKLQBVlAOmPSyLBP!LA.TICB 
HMQIRTKV&OlYRELCXRSVKNKTKCEIQASRQHCBSHKtiLLyD 
TVQSSLBHKXRRLKBDRHSXDITSKLWKDBbQSRKKRKDPPWPD 
KKKPGVVBGPYIVYHI^Il)ILELmrriRKAMATLGPnRVKTBP 
PVIGjKKHIiHSARSEBGRIjYYDGBHYIRGQTICIDKKDBCFTSAV 
ITTXNHDBVWFKRPDGSKSKLYISQLQKDKySIKHS 




410 


1519 

| 


eiapadi^ipacapvi^ratsstmgvtggkmapsltqbxlshi, 

GLAS KTTUVWGTLGTLRTPTA^S VDKD^QltLLRAITGQGVERSAI 
VDVLTNRSRBQRQLI SRNFQERTQQDLMKS LQAALSGNIiERTVM 
ALLQPTAQFDAQBLRTALKASDSAVDVAIB XLATRTPPQLQECL 
AVYKHNFQVRAVDGITSBTSGIIODIiLIALAKGGRD3YSGI idy 
NIAEQDVQALQRAEGP 3 RBEXWVPV7TQHNPEHLIRVFDQYQRS 
TGQKLKEAVQHXFHGDAQVALLGLASVXJQTrPLYFADKIiHQAIiQ 
BTBPNYOVLIRUjlSRCETDIJiSIRABPRKKFGKSLYSSIiQDAV 
KGTKX)SAWJVLCRABEW 


6417 


1 


845 


RGKSRVLWSBI^GKAGGAX5Gfr^SUWRrtDNRF 

LISTIYMAASIGTDFWYEYRSPVQENSSDLNKSIWDEPISDSAD 

EKTYffliftLPRYNGTVGLWRRCITIPKNMHWYSPPERTBSFDVVT 

KCVBPTLTBQPMBKFVDPGNHKSOIDLLRTYLHR0QFLLPFVSL 

GLMCPGALIGbCACIC&SljYPTlATGITiHULAGIfCTLGSVSCTV ' 

AG IELUZQKX*EIiPDNVSGEFGMS FQjACVSAPLQFMASALFIVIA 

AHraRKErnJflCAYRVA 


6418 


2 


662 


trtrprrppglgaavgkagarststpagaspaaayqadppp pah 
TPAPPPPPPOMIACKOBPAKPYGYDNU^QPIPTTQQBABLVQ 
YPDCKSSSGNIGEDPDHIKQSSaPSWFPfMRPOAAPGRRRGRQ 
TYSRFQTLBLBKBFLFHPYLTRKRRIHVSHALALTKRQVKINFQ 
HRRMKH KKKNNKDKFPV3RQ5VKDGETKKEA0KLKKDRAEGLTO 


6419 


1 


973 


PGRPRVRNFDIJJ5K3ILQBPPCXRSIQI PAHRSKTAMSKCPIFP - 
KARS ISTSG PLDKB37rGRQKLlSTGSI<PATLQGATDSlXn«BHHL 
PSPDPVTVFYLS PLVVWKELESIXHNEGDHAITVADPVDHHPrv 
PHNItVHYFRKLDLPSNLPGL ILSSBHQnCYSKIPSHCHSBIXSKY 
VLIQKIWDNMKLRQDPGQPLY IlAiNAHrQKYPMVHLLQKSEWSP 
NggUJC3WVK3 1 KMNPVYGPMSQrLETLNKCPHFKRQRSLYRgI 
LPI*SLV7iLCTKHXDIDAFDKHYKMAYI>RLTPSqVKSTHNCL>RPP 
STGVHECRKTFGBPTI# 


6420 


207 


1187 


RKMlDKNQTOJVGQDSVPYMICLIHII^KWFGVEQLEDYWfFAM 
YXiAfVFTPI^LLXbPYFTI PLLYLTI IPLHI YKRKHVLKEAYSH 
KLWDGARKTVATLWDGnAAVWHGYBVHaMBKl PBDGP ALI IPYH 
GAIPIDFYYFMAKIPIHKGRTCRVVADHFVFKIPGFSIiUiDVPC 
ALHGPREKCVEILRSGHLLAISPGGVRBALJSDBTSfHlVKGHRR 
GPAQVAIDARVPI I PMFTQSf IREGFRSDGGTRLFRtCLYBKFRYP 



501 
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6425 I 1850 



FAPMYGGFPVSXHTybGDPIPYOPQITASKLABKTKMAVQAiaD 
KHQRIPGNIHSALLBRFH 



WAIjSItRRQPEHMSKKLLSPHPHSWLRSBFKMASSPAVIiRASHIj 
TQWSLKSSAQPLGS PQIJIQVGQIIRVPAHHAATLI1BPAGRCCH 
DKPVRIAVRGLAPEQPVTLRASLRDKKGALFQAHAHYIlAErn/jK 
UHiKRAPAliGOSFAGLBPMGLIi5Rdi8PEKPI*VRLVKRDVRTPl»A 
VBIiBVlJ)CTDPDPGRIiIiCQTHHERYFI*PPGVRRBPVUVERVKCT 
LFIiPPBPGPPPGIVDHFGTGGGLLKYUASLLAGKGFAVMRLAYY 
NYEDLPXTKBTUO^TFSBAKETTLLSHPBVKGPGVGLIjGISBBG 
KLd>SMASFT*BGG I XAAVVJNGSVI\NVGGTLRYKGBTI»P PVGVNR 
HRIKVTKDGYADIVDVIiKBPLBGPDQKSFI PVBRAKSTFLFLVG 
QDDHHPnCSBPTDVNBACHKLOAHGRRKPQI ICVPBTCHYTKPFYF 
FLCRASLHUiV 63P1 IHGGBPRAHAMAQVDANKQLQTPFHKHliG 
GRBGTIPSKV 



EGKKI^WroEFWCDIAKEFYWKTPCPGPFLaYNFDVTlCGKXFIE 
WMKGATOtt CYNVl*DfiHVHBKKLGDSCVAFYWBQIEPGSTTQIT!r 
hqi^vqvoqpsnvijixqgihkgdrvaiyhphi pblwahlacar 

IGALHSIVFAGFSSBSIXERIIiDSSCfiliLITTDAFYHGBKLVNL 
KBLADBftLQKOQSKPP PVRCC3 WKHDOTAKLGWGDSTSQSP PX 
KRSCPDVQI SWHQGlDLHHHKLMQEftGDBCEPBWCDAKDPIiFIL 
YTSGSTGKPXGV VH T V GGlMLYVATTPltYVPPFHAHDVFWCTAD 
IGWITGHSYVTYGPLflBQATSVLPBGIPTYPPVNRUJSrVDKyK 
VTKFYTAFTAIRIJJ4KFUDBPVTKHSRASLQVIiGTVGKPINPEA 
HLWYHRVVGAQRCP IVDTF WQ TETGGHMLTPLPGATPMKPGSAT 
PPPFGVAPAILNESGBSIiEGBABGYI'VPKQPHPGIMRTVYGNHB 
RFBTTYPKKPPGYYVTGDGCQRDQDG YYW ITGRIDDMLNVSCHIL 
LKTAKVESALVKHEAVAEAAWOHPHPVKGKCLYCFVTbCDGMT 
P3PICLTBBLKKQIRBKIGPIATPDYI®iAPGLPKTJlSGlCIMRRV 
LRKXAQNDHDIiGDMSTVADPSVISHL*PSHRCX»T IQ 



ANLJCBIPHDI*PPBTVIiL YLtDSNQITa XPNEI FKDI*HQI»EVIiNX*S 
KKGIBPIDBH&PKGVABTLQTlALSnNRIQSVHKKAFNHliXARA 
RljtfaNPHHCDCTLQGFVLHSMASimBTAHSVICXTSVLDEHAC^ 
FLN7U^Nl>ADIjCNLPKKTn)YAMLVTMPCWFTMVI3VVVYYVRQN 
QKnAHRB^BYLKBLPSRQKKADKPDD ISTVV 



(SCVXiHUCTCNPIDKIJRLFTRGGSGGMGIPRIii 
GGKGGKBGiyvniVVAHIlRMTIiKatJaJRYpRKRFVMSVGMISICISA 

lkgskekewhipvfvgisvtdbkgkx igklnkkhdrilvaqggl 

GGKLLTNFXiPLKGQKRIIHUDLKLXADVGLVGFPNAGK&SLLSC 
VSHAKPAIADmFTTLKPBLGIOMYSDFKQISVADLPGiaBGAH 
HHKQtGaKP LKHl BRTRQLIJVVDISGFQLSSBTQYRIAPBYlI 
LLTKBLBLYKEBLQTXPiUiLAVtlKMDltPDAQDKFBSIiHSQLQ^P 
KDFLHLFBKMNIPBRTVBP^IPISAVTGSGIBEUCNCriRKSIj 
DBQARQBNDAIiHKKQLLHLsriSDTMSSTBPPSKHAVTTSKKDI I 



LAMBGGGGXPLB7IjKBBS0SRHVIJ>ASFBVnSLQKSHWGFLLTB 
LVGGTLVAVYAVATPFVTPAIJIKVCLPPVPA'IW KQ IBNWKMLR 
CHHC^LVDIGSGDGHIVIAAAKKGFTAVGySIiNPHLVWySHYRA 
ffiU3GVHGSAK7YISD£MKVTFSQYSHVVIFGVPQM>lLQU3KKLB 
RBLEDDAHVlACRFPPPHIfrPDHVTGEGIITTVWAYnASTPRGRE 
KRPCT3MHFQLPIQA 



SRGAAVGQISVAGGBXRGDTGGKDTAAPGRFSFSPKPTIjEDIPR 
LHABFAABRDWBQFHOPRNItliLALVOBVGBIABLPQWKTDGBPG 
PQGWSPRERAALQBKI^SDVLIYLVALAARCRVDLPLAVLSKMDI 
NHW^YPAHUiaSSSRKYTHLPHGaUSEDQAVGPADl PCDSTOQT 
ST 



AASWOPPHVPKAGKMVSWMI CRLVVLVFtMLCTAYASYKAVKTK 
NIRBYVRHMMywrVTALFKAAEIVTDIFISWFPFYYBXKMAFVL 
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HLI*SPYTKGASLLYHKFVHPSI»SJRHBKBXDAyiVXIAKBaS^BTV 
LSFGKHGU?1AASAAVQAAIK5 QGAXjAGRLRS7SMQDXiRS£SDA 
PAPAYHDPLYIiEDQVSHRRPPIGTBLAGGbQDSDTBDRCHSDTEA 
VPHAPARPRBKPLXRSQSIjRVVKRKPPVRBGTSRSLKVHTRKKT 
VPSDVDS 



S6SGGKMBDHQHVP IDIQTS KLLDWI*VDRRHCSLKWQSLVLTIR 
EKIKAAI QDMPKSBBIAQIiLSGSTI HYFHCLR ILDIiLKITrEAST 
KKI PGRYSSOHMKDWQE I IALYBK35}TrYLVHI>SSI^VRNVNYBI 

pslekqiakgqqlqqbysrkhhbcoagaaemreqfybsckqygi 

TGSHVKGKLLALVKniiPSQLAE IGAAAQQS1GKAIDVYQASVGP 
VCBS PTEQfVIiPMLRFVQKWSISTVYBMRTGTBPSWHRPHLEKL 
PEQVAKDAIDWGDFGVKAVSEC3TDSGISAEAAGIDWGIPPESD3 
KDPGGDG H3W0DDAVALQ ITVLSAGTQAPEOVARGPDALTIiLBY 
TBTHNQPLDBLMELKI FLACRAVELSEBADVDSVSQFQIiAPAI L 
QGQTKKKMVTT4VSVIiEDIiIGKLTSLQL0HLFWl LAS PR YVDRVT 
Bpr^X>irT f Tqy : Q T ‘ r ' ■nT.tnrer, M VQKQQKAI ^BQAALgP KLDLLLBK 
TKKLQKIaIKADISKUYBGBPVNLMOTSI* 



BPSSWTAAPRGPLAAHPLKAAVQSDDaRALSFDSRIKVFANGTL 
VVKSVTDKDWSDYIjCVAHHKVCTDYVVLKVDVVHKPAKIBHKBB 
HDHKVFYGGDLKVDCVATGIiPNPEISHSLPDaSLVNSFMQSDDS 
GGHTKRYVVFNHGTLYPNEVl&IRHEGDyTCPAENQVGKDEMRVR 
VKVVTAPATI RHK TCIAVOVPyGDVVTVlVCBAICGKPMPKVTWLS 
PTNKVI PISSBKYOI YQDGTLLIQKAaRSDSGNYTCLVRNSAGB 

drktvwihvnvqppkingnpkpittvrhiaaggsrklidckaeg 

IPTPRVLVlAPPEGVVLPAPYYGNRITVHOJGSLDIRSIiRXSIiSV 
QLVOiARNBG&RARLtVQLTVLEPMEKPIPHDPl SBKXTAKAGH 
TISLNCSAAGTPTPSLVWVLPNGTDLQSGQQIiQRFYHKADGMLH 
I SGI>SSVI1A(MYRCVARNAAGHTERLV3 IiKV0LKPKANKQYHNXj 
VSIINQETLKLPCTPP aZlGQGRPSWTLPNQiHLEGPQTIiGRVSL 
LOIGTLTVREASVP D K U T YVCRM g’tKY D PSVTSI PVTVXAYPPR 
IT S gPTP V XYTRPGMTViqjgCMftHBI PKADirWBLPDKSHUCftG 
VQARLYGHRFLHPQGSLTI QHATQRIlAGFYKCXftKNI LGSDSKT 
TYIHVF 



KTRVSTGliRRTIfLWSEAVGASSTftGDTGI PGSGBGOAGPGGGBG 
AMLEAHAEPSPBDPPPTLKPETQPPEKRRRTIKDFNKPC3FVLA 
YAGYIPPSKKESUVPASGSSSPIiRQBSAADSDOWDSAPSDLHTI 

qtpvkkaksskhraacmptqpgpprstfsrixiapdsatllbkm 

KLKDSLPDUDGPKVASPLSPTSLTHTSRPPAALTPVPLSQGDLS 

HPPRKKDRKNRKLGPGAGAGPGVUflRPRPTPGDGKB^RSRIKKSK 



1 .1 r .1 ♦:! u n y J* m*j • y * j N:< :>:< :* :jy :« > ; i 



TWGGBAPVPVLPT PPSAPRPPATVHPBGVPPADSBSKBVGSTB 
TSQDGDASSSBGBMRVMDEDIHVBSGDDSWDblTCYCRKPPAGR 
PMIECSLCXnWXHLSCAKIKKTOVPDFFYOQKCKBIiRP BARRING 
OPPKSGBP 



WNN SS YNL PAY7U?YI*PCBACAPIQDGHXGGAY7VGKMKATTAGVGa 
I£EEALRRXBRLKALREKT6RBPKEDGEPICrKRtRBBERE6BKR 
EEljRLRNY^rpEDBDLK3CRHVPQAEtPVAVHBKVKEQIi8AAKPBPV 
IEEVDLAin^APRKPI)TOIJ{RDVAKKLEKLKKRTQRAIAEI»lRER 
LSGQEDSLASAVDAATEQKTCDSD 



GGI/nTfGSRIKQNPETTFEVYVEVAYPRTGGTLSDPEVQRQFPE 

DYSDQEVLQTLTKFCFPFYVDSLWSQVQQNFTTVLTDIDSKQR 

pgfcrlssgakscfcilsylphpevfykllniladyttkrqenq 

WJBLLBTLHKLPIPrPGVSVHLSVHSYPTVPDTRELPSI PKNRN 
LTHYPVAVDVNNMLHLYASMLYKRRI LI ICSKLSTLTACIHGSA 
AMLYPNYWOHVYIPVLPPaLLDYCCAPHPYLIGIHLSIJiHKVEN 
PftLDDVVILIAnDTJrnjfiTPPDiniQSLPNDVI3SLKNRLKKV3TT 




503 
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TGIXJV»BAFIilCAQAAFFGS5fRHALKIBPBBPITfCSBAFVSHyR 
SGATWQPIXBSRTQLOLFRQFIDGRliDIJJiSGKGPSDVFKKKINM 
GBTJVGSDKLYHQWLSTVKKBSGfll LNTVKTKAHPAMKTVYKFDI 
AHHGCRPTPEEQLPKTAPS PLVRAKD PKLRKD RRP ITVHFGQVR 
PPRPHWKRPKSHIAVEGRRTSVPSPEQNTIATPATLHILQKSI 
THFAAKFPTEGWTSSSH 



APVTKRKBVFAKDSKGSALDAGRDPKRPALPBTLCBSGWASNTA 

PTTPPQPGWCLGGKII^KSSCQTPGRBKKRRLAIHHQSCSPLMLL 

IJIJJJiLYRTTGPVGAIjTDBBKRUWHLHIILYHAQWSPTASDML 

»HlW>BEL#tflFA3MARQ(VWGHNKBHGHfiGBHLF?^lTDB(3<DV 

PIj\MKBWHHHRKHYNLSAATCSPGQMCGHV'rQVVHAKTKkIGCG 

SHFCBKLQGVBETNIELLVCNYBPPGNVKnKRPYQBOTPCSQCP 

SGYHCKHBLCBPIGSPKI1AQDLPYLVTBAP3PRATEASDSRKM3 

AEGPDKPSWSGLMSGPQHVWGPLIiQLIiIiLPPLVLBGIF 



SPKLRQKSPIiFQPAKISSSTSHSIlASTKDCUTSALFQPAKISSN 

TSQUX^BPVKRCGKSALFQLAEHCLASBOIHHEBSKLISAXBS 

DGGRI KHIJ8KGKEKKBIKMB KTDKTRIjQKBABPBlCfiAKKNLRI® 

KELBHFEALQIPDIMftlKMgPPKBIRKBtfltgRPHKCSHPPDPSY 

SASSKiriSDVPSRICffiJMCaPHOimiKDPAALNXPBKLKKKKK 

K9KMDRHC2?DKSTPKKTCKKRQS3BSDIBSVXYTIBAVRKniJHG 

IBKIiGDTPRKKVRTSSSGKOS ILDAKPPKKKVKSRBKKMSKBKS 

SOTTKBSaPPDPISISASKBrSGETPBGIKRBPLTPMEDAUPS 

LSGQAKPKDSDCHRKlETCGSRKSBRSCKGALyKTLVSEGMLTS . 

LRftHVDRGKRSSGKraiSSDHEGCKNHKSWrFSQSGTSGSKKPKK 

TXPKRDC1iLGSAK1iDEBPKKKFNS1»PQ Y S PVT PDRKCVPVPRKK 

KKTGHVSSRPTKTSKGSGDKWSNKQLPLDAIHPTKMPSKDRNT 

HEPVHKVKNI PSI PNTPEPTTTARTFGGQPKEKSKEHPDYS PCQ 

DTQRAGYHHBBVLWHTNIMINCGGVYLEQliRHEANTNA. 



ALQREAAAAYAHPEY^RFLQBBTVSQ QIKS 

VVKSQRPLQRQVHIiRGRPASOPTVIROITYYKAKVSKBBMJIRE 
QQDBFFSGDNGVDLldXDOLIiKUNGLMTSVTRRPAATSQGHSTJl 
VTSDLMARTAPNSSALPQPSTSDPSIANHASVGPTI^JTTSVSPD 
PTRBSVLQPS PQVPATTVWTEATQQPAAPAPPAVS PRBALMBAM 
HWPVPPTTVRTOSI/5KDAPAGROTTPAS PTLspbBBDDXkti VX 
(aiCKirELSTITGPTTQSTYGRHEGAWMKDPlAKnElirYVTirYYT 
caJTLVBPEHLBNFKQQRWSNSYKlApYSWlGTCSWYlfGAFYYHR 
AFTTO»I IKTDIiKQRYVAAHAMiaDVAYBEATPWRKQGHSrJVDFA 
VDKMGLHLXYPALDDEG FSQKVXVLSKLHAADUSTQKETTWRTG 
LHMTPYfflfCFVICGVLYAVDSYHQHHAHI STRPDTHTHTQiypR 
IiPBHHYPYTTQIDYHPXDRLLYAWCNGHQVTYHVIEAT 



gacj^Pvrqppd^pdyeai^agaivtthwvagavag xlbhcvm 

YPIDCVKTRMQSLQPDPARRYRNVLKALHRI IRTBGLJIRPMRGI* 
NVTATGAGPAHALYPACYBKLKKTLS DVTHPGCRCSHlAHGAfiGC 
VATIXHDAaMHPAEVVKQRMQMYBS PYHRVTTDCVRRVWQHECTG 
AFYR3YTTQI*TKHVPF0AlHFMTTBPXvQHHFNPQRfiYWPSSHVL 
SGM3 VGAVAAAMTPLDVCKTIiI*BTQKSIJUjI 33HIT®IITG>4AS 
AFTtTVYQVGGVTAYFEGVQARVlYQIPSTAIAHSVYHFPKYLJT 
KEQBBWRftGK 



P PAPAPPASPARHVTKIARQHLKGGSHAPPIiIiaAVFIjQI KNMVK 
LIHTLADHGDIJVNOCAPSFSIiLATCSIiDKTIRLYSIOTPTKLPH 
SPLKFHTXAVHCCCFSPSCHIIiASCSTDGTrVIiRHTBNGQHIjAV 
MHQPSGSPVRVCQFSPDSTClASGAAIXxrwVIiNHAQSVICLYRaS 
SVKDGSIAftCAFS PNGSFFVTGSSCGDLTVWDDKMRCUESBKAH 
DLGITOCDFSSQPVSDGEQGLQFPRLASCGODOQfVKIHIVBFIH 
ILOFELKYlCSTLSGHCAPVIJVCnPSHDGQMLVSGSVDKSVTVYD 
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Araino acid segment containing signal peptide 
(AsAlanine, C=Cysteine, D^Aspaxtic Acid, R= ' 

Glutamic Acid, P^Phenylalonine , G=Glycine, 
H-Kistidine, I = I bo leu cine, K- Lysine, 

L= Leucine, K=Methionine, N=Asparagine, 

Proline, Q=Glut amine, R^Arginine, 

S=Serine, T=Tbreonine, V-Valine, 

W=Trypto|ihan, Y*»Tyroaine, XoOnkncwn, ♦^Stqp 
Codon, /«b possible nucleotide deletion, 

\=po sa ible nucleotide insertion) 
TJTTBNIDHTLTQHTRYVTTCAFAPNTLIiIATGSMDKTVNIKQFD 
LBTLCQARSTKKQLKQFTEDNSEKDVSTWLCAQDLKDLVGI FKM 
KM 1 DGKKLLHLTKBSIAMlIxKIBSLGLRSKVIiRXIKKLRTKVKS 
LSSG1 PDKFICPITRELMKDPVlASDGYSyRKEAMBHWDPAKRN 
HTSPP 

BVQILRAHHEXlIXKlIjlVPYGLIA(yrHAQl^lxPVPU3QTLPLNV “ 

KPALPLSPTGLAGSLlliALSNGLZtSGGIjLGXLEHLPLLDILKPG 

GGTSGGLLGGXiLGKVTS VI PGLNN I IDI KVTDPQLLHLGLVQSP 

DGHRLWTI PI£I KLQVBTPl#VGASLIjRLAVKLi>IXABlLAVRD 

KQBRIHLVLGDCTHSPGSMJISIJJX^GPI^IQGIIIJSLTGILN 

KVLPKLVQtSJVCPLVHBVLRGLDlTLVHDIVNMLIBaLQPVIKV 

SIQTAEAITTEMASQSCK1IQQLLQABKRAABHVADARKR5ARRL 

KOAKBBAQMKVBQYREEREHEPQSKQQAAMGSQQILSABVBQAT 

RRQVQGIIOSSQQRNRKRVLAQIJXawaWRPaVHPKYRISA 

rarhhsdmgdlpglvrjls iauiiq vndgwpVkvdgqrfgqnrt ' 

IKIiLTGSSYKVBVKZKPOTLQfVENISIGGVLVPLELKSKBPDGD 

RWVYTGTXDTEGVTPTKCGKRQPIQITMPFTDIGTFETVWQVKP 

YNTfflKRDHCQWGSPPSVIBYECKPHBTRSUWVNKKSKL 

KSGOLRRRQRPGRSAAVGBKRLt PGMBXFKAAMIJLG6Vta>AI/?Y ~ 

RKVCKBNSTVGHKIOSBLQRSGGIJDHLVLSPGEirPVBDNTIMHI 

ATAKALTTDyifCLODLXRBMVRCYVKlVKKLPKRJlPDPATIEGC 

AOLKPNNYIiLASfHTPFNBKGSGPGAATKAMCIGLSyWKPBHLBT 

LIKVSV2CGRWrHNHPT5PLG3I^rrALPVSFAA0GKPLVQ5KmD 

mlhavplareychktirhtaeyobhwpyfeakwqpylbkrkxsk 

dshkkaifpdhypabbrbktyrkwssbgrggrrghdapmiayda 

LIAAGNSNTBLCHRAMFHGGBSAATGTIAGCLFGIiLYGLDLVPK 

GLYQDLEDKBKLEDIjGAALYRLSTHEK 

ABDPAGGIiAGQDTMPARGLKRKCVGHBBDVEGALAGLKTVBSyS 

LQRQSLLK'ISLVKLQLQIIWjVBPNLCRSVLIANTVRQIQBBMTO 

DGTTfRTVAPQAAKRAPIjDBXVSTKILCRAABGQKSVHPASGIiGD 

(MTQGPVSDix^VTSAOAPHHLOSSAWKKDGPRBHRGfiFHKSLD 

QIFBTLBTKNPSCMEEIiPSIJVD2PYYDLDTVLTGMW5GARPGPC 

BGLBG LAP ATPGPSS S CKSDI/3ELDHVVBILVBT 

MASPAASSVRPPRPKXKPQTLVlPKNAAKSQKLKLKRI>ffiNPDK 

AVPIPBKMSBHAPPPPPBFVRDVMGSSMMSGEFHVYRHLRRR 

KYURODYMDAMAKKQKLQABFQXRLXKITKIAARBQTAICRRHia&Q 

iclkbkkliakkmklkqkkqkgpggpkkogssssakasgtebkbb 

VP2PTMGR 

G6TPRGXMRAPIPBPKPGDLIBIFRPFyRHWAIYVG0GyWHLA 

PPSBVAGAQAASVMSALTDKAIVKKBlXYDfVAGSDKyQVNNKHD 

DKYSPLPC3KIIQRAKKLVGQKVLYKLTSKKCKHFVNBLRYUVA 

RSDGVRTVI IAASVAGMGLAAMSLIGVMFSRNKRQKQ 

AfiAfiGAACAARSPRPQAHTKDVRGLP$aRJ^PDCGRrtBLAAGSP“ 

SBBQFtfBACABLQQPAIAGADWQLLVBTSGISIYRliLDKECrGLY 

BYKVFGVLBDCSPTLIADIYND3DYRKOHBOYVKELYEQBCHOB 

TVVYWBVKYPPPMSNHDYVYLRQRimLDHBGRKIHVILARSTSM 

pqlqbrsgvirvkqykqsiaiesdgkkgskvfmyyfdhpggqi p 

SKLINWAAKKGVPNFIJaJMARACQNYIiKXT 

RCPTRSP P PDTPGSRGTTAMCSLASGATGGRGAVENBBDLP BLS 

DSCTBAAWEDKDDADI^HGKQQTPCLPCNRIiPTSAKBTPSHCKS 

KRQFNlDSMVHKHGIiKPYGYIKLINPXRLKNPTVBYMNSIYNW 

PWSKKBYIJCPVLEDDIJiIjQFDVBDL YKPVSVPES YPNGLS ENTS 

VVBKLKHMEARALSAKAAIARARBDLQKMKQPAQDFVMHT0VRT 

CSSSTSVXADLQKDBDGVYFSSYGHYGIHKBMLKDKIRTKSYRD 

FIYQ^HIPTa>K\n/li D VG C GTGILSMFAAKAGRKKVLGVDQSSI 
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Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=Aspartic Acid, B» 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
BsHistidlne, I=Isoleucine, KoLysine, 
Ii=Leucine, M=Methionine, IfeAsparagine, 
PsProline, Q=Glutamine, R°*Arginine , 

S=Serine, T=*Threonine, V= Valine, 

W-Tryp tophan , YoTyrosine, X=Un3tnown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possiblc nucleotide insertion) 








LYQAMDIXIUjNKliEIiriTLIKGKIBBVUUPV^VbVXlSBWMGY 

FLLFKSHLDSVLYAKNKYIARGG3VYPDICTISLVAVSDVNKHA 

DRIAPVDiJVYGPKMSCMKKAVXPKAVVKVLDPKTLISEPaSIKH 

IDCHrrSISDLRFSSDPTLKITHTSKCIAIAGYFDIYFKKliCHN 

rvvfstgpqststhwkqtvfllbkpfsvkagbalkgkvtvhknk 

KDPR8L7VTLTLNNSTQTYGLQ 


H 




™ IoSb 


RIiBPABWHX^GPCHAIliyiAW8iiRALGVHAAMRGAPljCQRV5#lP " 
SRTMLATGI PSS KVKYSRLSSTPDGY IPLQPKKTPPKI PYKA1A 
IATVLPLIGAPLX 1 IGSLLLSGYISKGGADRAVPVLl IGILVFL 
PGFYHLRXAYYASBGYRGYSYDDIPDPDD 


6440 


74 


553 




6449 


597 


1876 


KY^CKNl^UCLKIlXJVSCHDVYAKLIllHVRlflLGLWQPDrGPYO 
GLLNVWDGLFI IGWMYLPPHDPHVnDPMRPKPLFRI HLMKRKA 
ATVBCHYGHRGPHHGHIQrVKKDRPSTKCMQTDHHRMSGGRQKB 
FRTKLRKBWGRTLEDI FHEHHQKI1LMKFIYT3QYENCLTYRRI 
YLPPSRPDDIiXKPGLPKGTYGSHGLKTVNIiSFHGRRARGTmTG 
DPNIPAOQQTVBXDLRHRIQIiPDASNQRNFNBIiSRIVLBVRKRV 
RQKQQEGGHKAGBGRGRQGPRBSQPSPAQPRAKAPSKOPDGTPG 
HDGGBPGDAVAAAK0PAQCGQGQPFVLFVGVSSSNBDYPRTCRI1 
CFYGTGLIAGffi3PTSPBRTPGVPILPDBD3UPGFVWIjELKSPSI#Y 
SRVOATPRNADAPSPQAPDEMLKNIQSLTS 


6450 


Q40 


5?9 


FVPAPRTVSGKJifiLPGEWKEMl^KQRTCRKPSGNGGRAVKAAR 
MRLLCGLWLWI>3LIiKVX*0AQTPTPLPIiPPPMQSPQGNQFQGBWP 
VIX3I»AGNSPRPEHRALI2iIAFTA*rPBI>SODGHPBVWNAJfrRGQHC 
DTWS YVL1 PAAQPGQFTVDHRV Wl’RBQAGR PQDQPAGQHLVAAS 
RDAGPVHLPGQSSGP1G 






933 


H5PTPPTSPlLASTMKDVKI£FPSl*PQCKKIlABEWTyPMRREMQE 
rLPGLFLG VYSSAMKS ECLP VLQKHG ITUZ ICXSQNIBANF1 KPN 
PQQZtPRYIA/ZfDIAENPVENX IRPPPHTKBPZDGSLQKGGKVIjVB 
GNAGXSRSAAFVIAYlMBTFGMKSiaXAPAyVQBIUtFClNPIlAGP 
VHQLQEYKAXYIAKIjTIQMMSPLQXBRSt^VHSGTTGSIiKRTHK 
BBDPPGTMCF/ATAQNG 


6452 


1 


652 


RTRG BSSNMKPiAA YPLKCSG PHAKVPAVLLS IVLCiVrbPLU} 
IJCFIJCPICINSFYAFSVKDAIQGRTVBLKICYTQGKV5LVVNVASDCQ 
bTDRNYLGLKELHKBFGPSHFSVLAFPCNGFGBSBPRPSKBVKS 
PARKNYGVTFPIPHKIKILGSBCTPAFRPLVnSSKKKPRMNPHK 
YLVNPBGQVVKFVRPSKPIBVTRPDXAAIjVRQfVIIKKKKDI* 


1 


827 


223 


HRRWLPGLSMSPRKlT^RPI^IirtiSrxn^CliAAAIifeSAQSGSC 

ROKKNCKVVFSQQBLRKR1TPLQYHVTQBXGTBSAPBG5YZHHK 

OPGIYKCWOGTPLPXSETKFDSGSGWPSFHEFVINSRArrFTDD 

FSYGMHRVHTSCSQOGAHI/5HIPDDGPRPTGKRYCINSAALSPT 

PADSSGTABGGSGVASPAQADXABL 


6454 






HRRHLPGLSMSPRRTL>PRPXjSIjCLSX£LCXiCXiAAALGSAGSGSC 
RDSKNDCVVPSQQKLRKRbTPLOYHVTQBKGTKSAPEGKYTHHK 
DPGIYKCVVCGTPIiFKSRTKFDSGSGWPS FHDVUiSEAITPTDD 
FSYQ4HRVETSCSQCGAHLGHI PDOGPRPXGKRYCZNSAALSFT 
PADSSGTAEGGSGVASPAQADKABL 




1042 


173 


RVHLATVSASAAHRALGLPVRSHHQGSTRRMGVMTDVHRRFLQIj 
LMTHGVI^EHDVKRI jQTirCYKVHDRNATVDKLKDFINNXNSVIiE 
SLYXBI KRGVTBDDGRPI YALVNLATTS ISKMATOFAENBLnLF 
EKALKLI IDSKTGFASSTNI LNLVDQLKGKKMRKKBABQVIjQKP 
VQNKWUnBKH5KPTLHGRAII»EMEQ YlHBTYPnAVKI CBICHSL 



506 
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Amino acid segment containing signal peptide 
{A=*Alanine, C^Cyeteine, D=Aspaxtic Acid, B=» 
Glutamic Add, F=Pbenylalanine r G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysin8, 

L= Leucine , M=Methi coins, N=Acparagine, 
ps Proline, QaGlutamine, R=Arginine, 

S=Serine, T°Threonine , v« Valine, 

R-Tryptophan, Y -Tyrosine, X=Unkno»m, *=Stqp 
Codon, /^possible nucleotide deletion, 
\cpossible nucleotide insertion) 



RPQSRSI3HWRHSLL0VSSGLHHLRVCAMVD 
TVBABAALQHKVVALyFAAARCAPSRDFTPLIfCDFYTALVABAR 
EPAPFBVVFVSAIXXSSQHiLDPMRKIjHGAHIAIiPPHDPTRHSLR 
KRYHVTAI PKLVIVKQlitSEVXTHKGRKQIRKRGLACPQDWVEAA 
DIFQNFSV 



PTTG FFVTN PPViNW PDG JCP PIWI LYVSKLKKIIHFPDFDXKI PV 
JO^LPIdiYVaraSGLSSTSKLSLPMPTVIiRKFTrPLTLLIiBT 
IIl/»KPYSXtgIILgVFAXIbGAFlAAGSDIiRPWLEGyiPV FLtn) 
I FTAAHGVyTTCQIWDPKKLGKYGVLFYRACPMI IPTLI ISVSTC 
DLQQATBPNQWKKVVFIIXJFIJ^CFWSFljIilYSTVlCSYfNSAL 

tiavvgaiknvsvayigiliggdyifsixnfvglnicmagglry 

SFLTLSSOLKPKPVGKKHICUJLKS 



piTCFPVfMPPWNWPIXJKPPIMILYVSKIina I HFPDFDKKIPV 
i rf.V PT.OTJ.VV nKUTg^^ CTggT^qT.PMPTVIJt^fleLTIJitjBT 
IILGKQYSLRI t LSVFAI ILGAFIAAGSDLAFNLKGYI FVFI21D 
IPTAANG\rrraQKMDPKBIGRYGVLFYN7\CPMI IPTLI ISVSTG 
DLQQATBFNQWIDHVVFILQPIiIiSCFLGPIiLHYSTVLCSYYNSM* 
TmVVGAIKHV5VAyiGlUGGDYIPSLL»FVGLNICMAGGLRY 
SFLTLSSQLKPKPVGBBKl CU3LKS 



KLFPLPLLYV(3IHISGI>5STSKLSLPMFTVLRKFTIPLTIaLIjBT 
IXUGKOYSLNI ILSVFAI I LGAPIAAGSbLAFHLBGYIFVPIAlD 
IPTAAHGVYTKQKMDPKBUJKYOVIiFYNACTMIIPTIjIISVSTG 
DLQQTVTBFliQKKHWPJliQFIiLSCFbGFIaliKYSTVlXSYYBSAL 
TTAVVGAIKNVSVAYIGILIGGDYI FSLljNPVGLHlCMftGGLRY 
SFLTLSSOLKPKPVGKBWI CLDLKS 



PTTGPPVTNPPWHWPDGKP PIHIlYVSKLNKJL IHFPDFDKXI PV 
KLFPLPLLYVGNHISGI*SSTS KLSLPMFTVLRKFTIPLTLLLBT 
IILGZQYSUaiLSVFAIILGAPIAAGSDLAFHLEGYIFVFU?D 
IFTAftHGVYTKQKHDPKKLGKYGVLFYNACFMIXPTLIISVSTO 
DLQQATBPN^fKHVVFILQFIiLSCFLGFIJU'iySTVIiCSYYBSAL 
TTAVVGAIKHVSVASTGIIjIGGDYIPSLLNFVGIiHICMAGGLaY 
SFLTLSSQLKPKPVUKBM I CLDLKS 



UlQHTIJUTAVGQTHPlAWMAWKPSIXiAFYGPASFITFVNCJIYF 
LSIFIQUCRHPKRICYKtiKKPTEEQQRIAANBNGBXNHQDSMSLS 
USTSAI^UBHTPHSQXjLGASLTLLLYVALHMFGALAVSLYYPI, 
DLVFSFVFGATSLSPSAFFVVHHCVMRKDVItLAH 1MTCCPGRS3 
YSVaVNVQPPN3N0TNQKAPKCPKS3AKSSClNK5ASSFKHSSQ 
GCKLTTJIXiAAAAQCHAKSLPIJJSTPQIjnNSLTKHSMDNDIKMHV 
APLBVQFRTHVHSSRHHKNRSKIfflRASRLTVLRBYAYDVPTSVE 
GS VQNGLP KSRIX3WEGHSRS RRAYI^AYREROYNP PQQDSSUAC 



* ft *J ri:-j -J :r* * :* ;« vA'AJ •< * ft C 



ffiiGPlKSNGQEGPLU^il^’ltiHVRTGLWKKHTTV 



SBKLDRKKKLKBDS PRKTPNKKSGVPSLPVSLTS IKBEPKBAKH 
PIJSOSKEKSKI^NDDRKTPVNHBDSRGTRVAVSSPMSOHQSYIQ 
YLHAYPYPQMYDPSHPAYRAVS PVLHHS YPGAYLSPGFHYPVYG 
KMBGRKBTBKVHTS PSVNTKTTTKSKALDIJjQQHRKQYRSKS pa 
PVEKAXRERBRRAERBRDRHSPFGC yHLHTlIHHIHVGMGYPLIP 
GQYDP FQGLTSAALVASQQVAAQA3ASGM FPGQRRB 



VILCIIGGWIFKHADRSMBKKKDEPRTRAEARPWVDBDLKDSSD 
LHQAKBDADBHQBSBENVEHI PPSHNHYPKKEMVKRSQBFYKLI. 
HKRRSVRFISHEQVPMBVI13MVTR!TAGI» 



GlLRQKESBBRNRIHKKBXLFIiBHKLWPSEMSSLSG 
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viiwm 



segment containing signal pepti 
(A=» Alanine, C^Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, P= Phenylalanine , &=Glycine. 

H ^Histidine, I»Isoleucine, K»l»yslne, 

I*= Leucine, H-Hethiooine, N=Asparagine, 
PaProline, Q=Glutanine, R»Axgininfi, 
S=Serlne, T=Threanine, V= Valine, 
0»Tryptophan, Y=Tyrosine, X-Gnknown, *^Sto 
COdon, /^possible nucleotide deletion, 
\«po33ible nucleotide insertion) 



LVBPSKLGRTLTHKHLAHTFIXXYCPPPPCQBAISKBPI 
YWIQKNAYSHKEKLQLHQHTBAIKEELLY pkakgggalventtt 

GISRDTQTIiKRLABBTGVHIISGAGFYVDATHSSBTRAMSVBQL 
TDVLMNB HjEGADGTS IKDBI IGBXGCSWPLTESKRKVLQATAH 
AQAQLGCPVIIHPGRSSRAPPQIIRILQHAGAOXflKTVHSHLDR 
TI LDKKBMiSFAQUKYX^YTOjSOTBIjlflYQKSPDIlJMPDDNKR 
IRBVRLLVEEGCEDRX LVTUffllHTKTRLHKYGGKGYSH I LTNVV 
PKMLLRGITBNVLDKILXENPKQWLTPK 



gmVFFKTLRNHNKKTTAGLCliLTWGGHiCL YGKjtCTyLI .RRAAC 
OKAQVFGHOLTPPNACfVKKATVFljlJPAACiOGKARTLFKKNAAPI 
LHLSGMDVT I VKTDYBGQAKKLIJaMBmTJVl IVAGGDGTLQBV 
VTGVLRRTDKAXFSKIPIGFIPIjGKTSSLSHTLFAESGNKVQHI 
TDATIAXVIDGBTVPIXfVljQrKaBHEQPVFAffrGLRSfGS PRDAGV 
KVSKYHYLBPLKI KAAHFF9TLKBWPQTIKQAS ISYTGPTBRPPN 
BPEBTPVQHPSLYRfiXLRRLASYHAQPQnALSQBVSPBVWKlJVQ 
I^TIKLSlTTRKNQIiDPTSKKDFLIflCIKPDTISKErDPITIGaR 
KVBHPKIiHVBGTECLQASQCTIiLIPEGAGGSPSIDSBBYEAMPV 
BVELLPRKLQFFCDPRKSEQHLTSPTQ 



vabgtelsqlkkahppadhgrrkskrbcpppkkkhtgtlbtq PTC 

PPCHHKKSCDVKMDHARNTGVISOTVCLKEPQTPITYL3BPVDV 

YSDHIDACBAANQ 



GBLRVLALAHGKLACHAVLTASLLSLRSRLMDSDHDy KR PNVBT 
I KCVV^jDHAVGKTKLI CARACNATLTQY QLLATHVPTVWAIDQ 
YRVCQBVIiKRSRDVVDDVSVSIiRIiIIDTPGDHHKDRRFAYGRSDV 
\AnXPSIANPHSLHHVKTMWYPKlKHFCPRAPVlLVGCOLDLRY 
ADLEAVNRARJIPLARPXKPNBILPPBRGRBVAKELGXPYYBTSV 
VAQFGIKIArPDKMRAALXaRiatliQFWCSHLRNVQRPIjliQAPFTj 
PPKPPPPIIWPDPPSSSBECPAHIiLBDPLCADVIIjVLQSRVRI 
FAHKIYLSTSSSKFYDLFLMDLSBGKLGGPSBPGGTHPEPHQGH 
SOQHHHHHHHHHGRDFLLRAASFOVCBSVDKAGGSGPAGIiRAST 
SDGILRGNGTGYUO?GHVLSSHSfiAFVSIQBEMABDPLTYKSR 
WVVVKIIOSSIQPGPPRAVLICY LYTGH LDBHKBK^HlAHIAgL 
LKVFDLRMMVABlLKWBAPMNQBTTKAFHVItRTNttVKECIARGT 
FSDVTF I LDDGTISAHKPLLX SSCDWKAAMFGG P FVB3STRKW 
FPVTS JCSCKRAVLEYL YTGWFTSSPDLDDMCI*I ILAKRLCLPHL 
V7VLTKQYTVTGLMBATQMMVDIIX3im*VFLBLAQFErCAYQLADH 
CLHHICTNnWVCRKFPRIJHKRMSPEHQBYFBKHttWPPVWYIjKB 

bdhyqewrkbrbkbdylhlkrqpkrhwlfwispsspsssaasss 

SPSSSSAW 



DAMAQTMMAALAPVGSPASHGPRLAAGL!tLLPt l 1I«GLI<Or JABPG 
LGRVHHIJUiKDirVRHKVHLHTPGPFKDGYMVVNVSSLSLNB PBI) 
KDVTIGPSlllHYXHDGFSSYIJJBDVNYCXLKKQSVSVTLLXIiDI 
SRSBVRVKSPPBAGTQIiPKI IPSRDBKVLGQSQBPNVNPASAGH 
QYQKTQlOGtSKSKRSTVDSHAMGBKSFSVHHBGGAVSPQFFFNIS 
tDDQBGLYSLYEHJCCLGKBLPSDKPTFSIiDIKITBKMPDSYLSA 
GBIPIJKLYTatAFFFFIiSGTIKIHIIiRKRRNDV7KIHWI/4AAL 

pptkslslvfhaioyhyissqgfpibgwvvvyyithllkgallf 

ITIALIGTOWAPIKHILSDJCDXKIFMIVIPRRVIANVAYIIIBS 

tbbgttbyglhkdsi^vdllcogailfpvvwsiehlqbasatd 

GKGKPSRAHFVLL3LL 



DAHnGTNMAAIAPVGSPASSGPRIAAGLHJbLPMLGLLQLLAKPG 
LGRVHHIiAIilCDDVRHKVHLHTPGFPKDGYWVVNVSSLSLNEPBD 
iOArriGFSLD RTK BIX3F S SYIiDBDVHYCXIXgQSVSVTLLILDI 
SRSKVRVKS PPKAGTQLPKI IFSRDEKVLGQSQB PNVNPASAGN 
QTQKTQDGGKSgRSTVDSKAMGEKSFSVHHBGGAVSFQFFFIJIS 
TDDQBGLYSLYFHKCbGKHLPSDKFTFSLDI KITBKNPDS Y1>SA 
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-Amino acid segment containing signal peptide 
(A=Alanine, 0=Cyeteine, D^Aspartic Acid, B=> 
Glutamic Acid. PaPhenylnlan-lm*, H-Ol yr~l rw=» 
H=Histidine, I»Isoleucine, K“ Lysine , 
L=teucine, M=^fethionine, tfcAspaxagine , 
P=Proline, Q=Glutamine , R=Arginine, 

5=5erine, T= Threonine , V=Valine, 
ff^Trypt ophan , Y=Tyrosine, X=Onknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\«poasible nucleotide insertion) 








GBIPLPKbYISHAPFFFLSGTl MXHJUjAKRH^VFJOHWJ^IAAIj 
PPTK 3LSLVFHAIDYHYTSSQGPPIBGWAVVYYXIHLLK3GALLF 
ITIAI^GTGHAPIKHILSDKDXKIPmVIPHRVLANVASTIIBS 
TBKGTTBYGUffKDSLPLVOLLCCGAlLFPVVHSIRHLQBASAID 
CTtGgPgRAWPVT.I^T.I. 


1 


2726 


1437 


AAASGVS3RADAPVLAQSPASAQBGRPSTPRVPGSRHHPSAPRS 

GPLPREDGCRTPGPQLIJ’LPGALLRPRTLLSSAABTGILSRHFDT 

qhpssggrcrggtbspssaagrpasmaeaerdchsdtvradddb 
bhespabtdloaqlqmpraghnfblapgvsssnlbne pcraaeg 

SLQKTSADTKGKOBQAKKKKARBLPLKAVBBBQNGALYKAIKPY 

RRAMQLVPDIBPKITYTRSPDGDGVGNSyiKraiDDDSKMADLLS 

YPQQQLTFQBSVUCLCQPBLESSQIHISVLFMBVIlfflFItNWS 

SDIJ>ZJlSI*KQL^VCKGFVriCAlU?PBINRIACLKWGR5CIKLV 

PTTTSWREMFLKHPHVRFDGVYISKTTyiRQGBQSUXJFyRAWHQ 

VByYHYIRPPPDGHVMMLTTPBBP03lVPRliB.TR 


6471 


1750 


293 


PPPbKMAAGGSGVGGKRSSKSDADSGPLGIiRPTSVDPAIAJLRRH 

GPRNKKRGWRRIAQBPLGI^VDQPliBDVRLQBHTSGGLLSEAPN 

BKLPPVDTOSKBKGLTIXRTKVQXKGtJJtKKPlittVDLlLBMTSK 

VPAPKDVIAHQVPNAKKLRRKBQIMKKLARQGKLP'RBVRRAQAR 

LLNPSATHAKPGPQDTVKRPPYDLilAfiENPLDRPLVGQDBPFLB 

CTKKKGVKRPARLHTXP SQAPAVEVAPAGA3YHP9 PRDHQTLLS 

AAHBVKDQRQKKABKLKRWALPATBOAATQBSTFOBLCBGIiLB 

bspgeqepqqgbgpbagdahvcptfariattbkkikqqrrrbka 

VHEIiRVQQAALRAAlUiRHQKLFRLRG I KAQVALRIAELARRQRK 
RQ3\RRBABADKPRRLGRLKYQAPDIDVQLSSKLTDSLRTLXPBG 
NILRDRFKSPQRRNMIBPRERAKPKRKyKVKLVBKRAFHBIQL 




3 

| 


897 


SOGSDRAQWAMBPPFDVDALFPBRITVLDQHIiRPPARRPQTTTP 
ARVDLQQQINTI IXJBLGKASAKAQNLSAPrrSASRMQSNRHVVY 
IIlOJSSARPAGKGAIIGPIKVGYKKIiPVLDDRBAHNEVBPLCrL 
DPYIHE3VQRHOHGRKLFQYMLQKSRVBPHQIAIDRPSOXLLKP 
UJKHYNLBTTVPOVNNFVI FEGFFARQHRPPAPSLRATRRSRAA 
AVDPTPAAPARXLPPKRABGDIKPYSSSDRBPLKVAVBPPHPLM 
raprraxppahppprssslgnspkrgplrpfvp 


6473 


22 


912 


SSAVBFVWHGKKMAAE^NXTKIQTLPKRliRAVPTNRAi!Fl)CGAX 
NPSMASITYGVFixrCDCSGVHRSI/JVHLSPIRSTBLDSNMNHPQ 
UaOQVGGNAlttTAFPRQHGCTANDANTKXKSRAAQMYRKKIRQ 
IX33AALARHGTDLWIDNM3SAVPHH3PBKKDSDF FTKBTQPPAH 
DAPATEPSGTQQPAPSTBSSGLAQPBBGPNTDLLGTS PKASLEL 
KSSI IGKKKPAAAKKGLGAKKGLBAQKVSSQS PSBIBBQAQVAB 
KLRBQQAAnAKKQAEBSMVASMRLAYOBLQIBR 


6474 


3 


462 


LCU^Q^AAAPAVPVlK^PCFTDi VIMPKRKJ3 PKNTBG KDGS 
FCVTKQB PTRRSARLSAKPAPP KPKPKPRKTSAKKE POAKISRGA 
KGKXBKKQEAGKEGTAPSENGETKAEBIHI SRSTVNVSTSRGTP 
1 PSTLSVKGQIBTVRVKGTRN 


6475 


3 






6476 


106 


1090 


aramaqykgtmrragrahhllkkrkrqreqmbvltoriabbtil 

KSQyDKRFSAHYDAVBAELKSSTVGLVTLOTJMKARQRALVRERE 
RQIiAKROHIiKEQRLQQERQRKQKQRRHRKRKISCLSPAIjDDLIID 
QADAAEARRAO^LGKNPDVDTS FLPDRDRKRKENRltREELRQBW 

baqrbkvkdbembvtfsytogsghhrtvrvrxkntvqqplkkai, 
qglrkdplelrsagveqlmfi kkdlilphyhtfydpi iarargk 

SGPLFSFDVHDDVHLLSDATMBKDBSHAGKVVIdlSHYBKNKHI P 

pasrnbaydpkrkwdkytir 



509 
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Amino acid segment containing signal peptide 
(A-JVlanine, OCyateine, D-Aspartic Acid, B= 
Glutamic Acid, P= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine. K=Lysine, 
I»°Leucine . M=Methianine ( N=Asparagine, 
P=Prollne, Q=Glutamine, RaArginine, 

S=Serine, TaThreonine, V=Valine # 
W=»Tryptophan, YaTyrosine, X~Unknown, ♦-Stop 
Codon, /^possible nucleotide deletion, 
\=>posoible nucleotide insertion) 


6477 


227 


915 


LQGHLMGmAASRPLSRFWBHGXZiXVCVGRHYADUVREMRSAVl!* 
SBPVLPLKPSTAMJPEGSPXI/IPAYTHNIiHHBLELGVVMGKRCR 
AVPEAAAMDYVGGYALCI281TAHDVQDECKKKGLPWTLAKSPXA 
SCPVSAPVPXKJCTPDPinCLKLHLKVNGBIiRQEGBTSSMIPBIPY 
1 ISYVSK1 ITLBEGDZ ZLTBTPBSVGPVKBHDBZSAOZBGLVSM 
TFKVEKPHY 


6478 


2 


— 145s 


FVSSlULPBSLAS^BASTLBAKGRKKBDDCSSWKKOrmXRKIF - 

I PMEVLGSGAPSHVFIiVKQRI^TCKLFALKCI KK5PAFRDSSLEK 

BlAVIJaaXHEHXVTt^IYBSTTHYYIiVHOLVSGGELFDRILE 

RGVYTgKDASLVXQQfVI*SAVKYLHBHGlVHRPI*KPBMLLYI*TPB 

EHSKIMITDFGI^KMEaNGIMSTACGTPGXVAPEVLAQKPYSKA 

VDCWSIGVITYILLCGYPPFYEBTBSKLFEKIKBGYYBPBSPPW 

ddisksakdpicslwkdpnbrytcbkalshpwidgntamrdi 

YPSVSLQIQKNFAKSJOmOAPHAAAVVHHMRKLHMNLHSPGVRP 

BVKHRPPBTQASBTSRPSSPErrrrBAPVLDHSVALPALTQLPC 

QHGERPTAPGGRStiKCLVNGSLHISSSIjVFMHQGSLAAGPCGCC 

SSCUnGSKGKSSYCSBPTXJJQCANXKQHPKSKVHVPVXASGSS 

HCRAGQTGWCLIH 


6479 


3 


949 


5U3dCPC»mPAGG0AGAMRlJ^AI^LGEIJU*S PSRVPbFPVPDJUS ~ 
YPIVS ILYLKYBPGAVFL3RElIPIA£HIiCAMliHCFG3YIIiADLl» 
LGBPLIDYPSHNSSIUiASAVWYia PPCPLDLPYKCVCFIiPVKL 
I FVAMKEWRVRKIAVG XHHAHHHYHHGWFVNIATGWVB3SGVA 
U4SWFEQIiLRGWKPBTNBIIjHMSFPTKASI»YGAILPTLQQrnW 
LPVSKASLlPIPTLPMVSCKVPLTATHSHSSPFDAIiEGYICP'VL 

fgsaoggdhhhdnhggshsgggpgaqhsampakskkei>segsrk 

KKAXXAD 


6480 


192 


514 


DFHSXYPPniCPOYljRSAKMTBVMWNTQ PMBBIGIiSPRKDGLS Y 
QXPPDPSDPDRCCKbKDRLPS IWBPTEGEVESGKLRKPPBEFI* 
VQBDBQDNCEETAKENKEQ 




110 


1131 


I^RMDLDVVPMPVXAGGTIiAI PXIAFVASFLLMPSAltlRXYYWY 
WRRTIjGMQfVRYVHHKDYQPCYS FRGRPGHKPS XLMLHGFSAHKD 
MWL3VVXFbP^NLHLVCVBMPGHBGTTRSSLDDI*S IDGOVKRXH 
QFVECIXLNKKPPHLVnSTSMGGQVTVGVYAAYYPSDVSSWJLVCp 
AGLQYSTDNQPVQRLKEIiQGSAAVEKlPJOIPSTPEBMSBMLQLC 
SYVRFKVPQQILQGlVDVRlPHNNFYRKLPbBIVSBKSRYSLHQ 
HHDKIKVPTQIIHGKQDQVIJJVSQ\DMLAKSIArTCQVKt»IiErfOG 
HSVVMBRPRKIAKLI IDFLASVHNTUtUf KALD 


6482 


2517 


563 


BPVSkvSQSRRKAGVPTANlBHSGAVBAAMANVPHAEVCBKFQA~ 

AIALSRVELHKNPBKEPYKSKYSARAIiLEEVKALLGPAPEDEDE 

RPEAEDGPGAGDHALGliPAEWB PEG PVAQHAVRLAVXBFHLGV 

HHIDTEKl>SAflBEHLVECLRT*T»RRYRl*3HPCI3LCIQAQaOBUgi 

lwserbbietaqaylessealthoymkevosppldptkrflpbe 

BKLTEQHRSKRPBKSnfTHNLYYl»AQVYQHLBMFBKAAHYCHSTL 

B31QLBHNAYHPIEWAINAAXLSQFYINKLCFMKARHCLSAANVI 

PGQTOKISATEDTPBAEGHVPKLYHCSIKGEIARCWIKYCLrLMQ 

ftAQLSMQDNIGBIJJIjDKOSBIiRALRKKELDREBSlRKKAVQFGT 

GKXjCDAISAVBgKVSYLRPLDPKKARHLFIiLGQHYVFRAKBFFQ 

IIXJYVTDHIEVVQDHSALFKGIAPFETDMEHfiCKMHKRBIAMLE 

PLTVDLHPQYHiLVN3^IQFBIAHAYYI24MDLKVAIADRIiRDPD 

SHI VKKIl^KIiNKSALKYYQIjFLDSDRDPNKVFPEHIGKIJVIJlPA 

MLAKPRVAHLYOdlTADPKKKLEHIiAXSIiKHYXFIVDYCEKHP 




KAAQHIBVBLBXjSKETIVSLLPTKWBEPRTKMALT 
NSHTiT .TTCT.R RRAPLSANGRBABAMBQRL^BFHAARKRAGLAAQP 
PAASQGAQTPGKKABAAATLKAAPGWLKRFLVWKPRPASAHAQP 
GLVCBAAQPQGSrSBTPHNTAIPLPSCMQSFLTNITFIiKVLr^ 
LVLLGUVBLBFGLAYPVL3LPYWMYVGTR0PBBKKE3BK3AY5 



5 
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Amino acid segment containing signal peptide 
UW&anine, C-Cysteine, D= Aspartic Acid, 
Glutamic Arid, Y- Phenylalanine. , G=Glydne, 
H^Histidine, J=>Ieoleucine, K° Lysine. 

L= Leucine, M^Methionina, N^Asparagine, 
Psproline, Q=Glutamine, R=Arginine, 

S=Serine, T=Threooine, V»Valine, 

W=Tryptophan , Y=Tyrosine, X=»Onknovn, # =Stop 
Codon, /^possible nucleotide deletion, 
^possible nucleotide insertion) 








jEsasssss i i 


6404 


201 




QLAVKXKMSGLRPGTIXJVDPKIKLFVKAGSDGSSIGHCPFCQRLF 

DPIKTBBPIiBQTIAPPRYPSLSPKYKBSPDVGQJLFAKPSAYIK 
HTQKBANKHPBKSLLHB PKRLWJYLiJTPIiLDEIDPDSAKBP PVS 
RHLFLIX^QLTTADCSLLPKLHI IKVAAKKYRDFD I PAE PSGVW 
RYIiHNAYAREKFTOTCPKDKBIKIJTYANVAKQKS 


6405 


6 


1091 


PVPLVRAVSFLPCPDaOXLKKKnQSSEBSHGSNSMRSILggDKE 
DKBPPKVU.YHRPftSPEVGMLVWHKHKKYPFWPAWESVRQRDK 
KASVLYI BGHMNPKMKG FTVSLKSIJCHFDCXRKXniJ^ 

HQJ) IGWCVSL ITDYHVRLGOGS FRGS PUffYYAADIS YPVRKS IQ 
QDVLGTKLPQLSKGS PEXPVVGCPLGQRQPCRKMLFD8SRAARD 
RANQKLVgYIGKAKCAgSHIiRAILKSRKPSRWLarrFLSSSQYVT 
CVBTYIiBDBGQLDLVVKYIjQGVYQHVGSVKVLQRTNGDRIRFrLD 
VLLPEAIICAISAGDKVDYKTABKKYIKGPSLSYRKKEI PDNQL 

LBSKNRRRR 


6406 


10 


“ 501 


LVI^AGGA^P^VTQGIYYMlAFSBHPKPPDyaBLSIJSLTLA 
GGT(3lPSGPLHRAHHroiNFRQRHGHIGVGLYLIjASAAAFYYVFB 
T B>nTNPTAIJ^rOQfHPmPIJ<trriVlliSLKAQIiL3LPFWVWTV 
IPLVPYUQMFI»PLYSCTRADPKTV13 YCIXPICIiAVICMRHQAFV 
KASKQISRLQLIDT 


6487 


352 


863 


SPLlPLKGKl^SVTLHTIJVGDIjQBViXKRTPKTCENFLALCASN 
YYNGCIFHRMI KGFWVQlTiD PTGTGRGGNS IWGKKPHDBYSKYL 

khwvrgwsmahhgpntkgsqppitygkqphldmkytvpgkvxd 

GLETLDELBKLPVNSRTYRPLNDVHI KD1TIHANPPAQ 


6488 


070 


241 


TALQB PGTSG P PLSmPALPSGTP0RFKPI*PGARGPSWPP8 PRVP 
MBPPNLYPVKLYVYDLSKGLARRIiSPIMLGKQliBGIHHTSIWH 
KDBPPPGSGGISSCPPGGTIiLaPPDSYVDVGSTEVTHKIPLEYL 
SSLGHSLFRGBAINLPBHHCNTFSNBVAQFLTGRiaPSYITDLP 
SEVLSTPPGQALRPIiLDSIQIQPPGGSSVGRPMGQS 


6409 


1457 


375 


KVAKMAT^IiSBBBLb^BDYY SliNAIRREASBEKLKAAYRltLCML ‘ 
YHPDKHRDPSLSSQAEQUiFHIiVBOAYBVLSDPOTRAlYDI YGKR 

glbmbghbwbrrrtpabirbbfhrixsibkbbbrlqqrthpkgt 
isvgvdatdlfdrydbbyedvsgssppqibihkkhisqs I KAPL 

TATPTAI LSG3LSTQNGNGGG3 INFALRRVTSAKGWGKLEPGAG 
DIXX^PLFGLKLFRHIiTPRCFVTTHCALQPSSIlGIRPGLTTVlAR 
OT.nywrvr^^ilwr^CPIJi3VnRPHRMTRACAPBP5PRPPLaVP 

thuakcsgartpstawtsaavkimaclsgpgsgshqlllltpr 

skrrtggg 


6490 


3 


1183 


USAGCBVWLGYGPRAAAAAAATVIJ^GGAGPTBTMFVARS IAADH 
KDLIHDVSPDPHGRBMATCSSDOSVKVWDKSBSGDNHCIASWKr 
HSGSVHRVTMAHPBFGOfVIiASCSFDMTUVVWKBIVCBSWDKLRG 
QSHHVKRnLVDSRTSVTDVKPARSHMGmLATC9ADGIVRIY2 
APDVMHLSQHSUZHBISCKL5CSCISVHP5SSRAHSPMIAVGSD 
DSSPHAMAKVOlPBYSraTTREYAKABTlHTVTDPVHDIAPAPHL 
GHSPHILAIATKDVRIFTUCPVRKBLTS8GGPTKPBIHIVAQFD 
HHNSCyVWRVSWNITGTVLASSGDDQCVRLJfKANYMDNWKCTGIL 
Kt35GSPVHGS6QQGT6HP3LGSHIPSI^JHSLHG33AGRKHS 


6491 


3 


1183 


HSAGCBVHLGYGPRAAAAAAAIVIiFGGAGPTBTHFVARSIAADH 

KDLIHIWSFDFHGWRMATCSSDQSVKVtfIJKSESGDWHCrASHKT 

HSGSVWinrrHAHPBFGQfVLASCSPDRTAAVHKBIVGBSPlDKLRG 

QSHWKHTTLVDSRTSWTDVKPAPBHMGIiHLATCSADGIVRIYB 

APDVMNX»SQWSLQHEISCXLSCSCISWHPSSSRAH5PMXAVGSD 

DSSPNAMAKVQIFBYNKNTRXYARABTI^trVTDPVHDIAEAPNL 

QHSFHIIAIATHDVHIPTIiKPVRKBLXSSGGPTKFBIHIVAOFD 

HHHSQVHRVSWHITGTVIASSGDlXX^/RLWXANYMDNH30CrGIL 



5 
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;< ei , rS~ j U V > t O ! ct? .H f ;■ $ t ;■ K 



IPFlJtSCOCCCLiFDFPPPPIjDQVQEKBCSVSRVTEHGTPKPFRK 
PnSVAFGBSC^EDEQPKNDLBTDPPHKQQLVSRKVIXGIiKPCSl 
KRQKVllffiLFYTSRAHVRrLKVLDQVFYQaVSREOIIiS PS KLRK 
I PSSljHJILOLHIGUJBQHKAVRKENBTSVXDQrGEDLUHPSG 
PGSBKLKHAftATPCSHQPFAIiEMXKERQXKDSRPQfrFVQtlABSN 
PX^RIiQIiKPIIPTpfORLTKyPIiliLOCTIKTYIEWPTB R Ky V KK 
AADHCRQILNYVHQAVKKABNKQRIiEDyQRRIjDTSSIJCLS SYPN 
VBSLRNLDLTCHKNIHBGPLVHKVNSDECTZDLYTLLLBOrLVtiL 
QKQDIlHI>VLRC^BQl»ASTW>SKHT , FSPVXKLSTVLVRQV7^rDN 
KAliPVISMSDNGAQIYBLVnQTVSBXrVffQDLlCRMAASVKEQS 
TKPIPLPQSTPGBGDNDBEDPSKLKHEQHCSISVTGLQSPDRDLO 
LBSTLlSSKPQSUSLSTSOXSBVRDLFVAHtQFAKSQRTDGTCiK 
BVOKDYQXAI PBSHLpVSEERWOJlAIiRNL^^ 
EKSVQBDHQUPPRYHTASCXaPGfrDSVXQNSBNlKAYHSGEGHMP 
FfiTGTCDIATCYSPRTSTBSFAPRDSVGIAPQDSQASNTLVMDa 
MD4TPEMPTMSPKGGLDnSGKHFFnAREflHSDKWPSBGDGAVHK 
KKKDVNLRIS42JTLHjDGYDPVOESSTDKEY7^SSLTLQPHTGIP 

avbsthqqqhspcotksdgai spftpeflvqqhwqahbyscfbi 

QSPSCCADS^JSQIMKYIHKXKADLBHIiKKVKESTmLOCRLAGS 

ALTDKH3DKS 



TPABWAYQG3STSDCMSKTLDSASAHFAA3AVVSAPVPSRSKVA 
KEQpSTGHNNINGVVQPSGTSKTLYSTNHALSSSPGISAVQLVRr 
VGHTTTHHLI PALCTSSPQTLPMNNS CLTHAVHLNNVSVV5 PVH 
VHINTRTSAPSPTALKIATVAASMDRVPKVTPSSAIS8IAHBNH 
KPKHLGLKGXABTTVAHKVT 



AVAGGARPCSTPSS PHRRCRRHRPR PLPRPPAAIKSASAVYVLD 
LKGKVLICRHYRODVDMSBVEHFHPIUffiKEEBCMLSPIliAHGG 
VRFMW I KHNNLYLVAXSKKNACVSI^VFS FLYKWQVFS BY FKJKL 
BEES IBXQTFVlIYELU^BIMDFGYP'QTTnSKILQBYXTQBGHKL 
BTGAPRPPATVTHAVSWRSBGIKYRKNBVPLDVIBSVNLLVSAH 
GNVLRSBIVGSIKMRVFIiSGtPBIJUjGIiKDKVLFlJWrGRCJKSKS 
VBLEDVKFROC^BLSRFBHDRTISPIPPIIGBFBtMSYRUrrHVK 
PLIWIBSVIKKHSHSRIBYNIKAKfiQFKHRSTANNVBIHIPVPW 
DADtSPKFKTTVCSVXHVFBKSBIVnSIlCSFPGGKBYljMRAKFQL 
PSVEAEDXBGKPPISVXFBl P YFTTSGI QVRYLKI I BlLSGTfQAL 
PNVRY ITQNGDYQLRTQ 



AV7U3GASPCSTPS5PHRRCRRHR PRPLPRPPAAIKSA3A 

LKGXVLldtHYRGDVtttSBVBRFMPUMSKEBBGKliSPILABOG 

VRPlWlKHIOn»YLVArSKKK7VCVSLVF9FLYKVVg V F SgY F XKL 

BBBS nag lF Vl I Y B L LDgtMDFgYPQOTDSKILOBYITOBOHKL 

BTGAPRPPATVTWVSWRSBOIKYRKHEVFLDVIBSVNIiVSAH 

CTVIJlSBXVG3IKMRVFL3GMPKLRLGLliDKVLgPSTGaSRSKS 

VKLBD\nCFHQCVRI^HFSOTmTISFIPPDGBFKU1SYRLNTHVK 

PLIWXBSVIBKHSHSRJCBYMIKAKSQFKRRSTANNVBIHIPVPM 

DAD SPKF KTT V GS VKWVPB HSHIVWgXKSPPGGKBYmRAHPGL 

PSVEABDKB0KPPISVKFKIPYFTTSG1QVRYLKIIBK5GYQAL 

PWVRYTTCNGDYQLRTQ 




AKTQICRuCPRRHLHPPOGAKMGNGTEKDYNFVFKWLIGBSGV 
GKMJ*SRFTRNBFSHDSHTTIOVKFSTRTVMIiGYAAVKAQIWD 
TAGLERYRAI TSAYYPGAVGALLVFDLTEHCyrYAWER WLKBLY 
DHAEAT rVVHLVGNKSDLSOARBVPTBKAHMFAENWGLIiFIiSTS 
ALDSTNVBIAFBTVLBBIFMCVSKQRQNSIRTRAITIjGSAQAQO 
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SLRLCPWCTIHljRGFrTTUUjSSLLJUjljRPAlAPL IIXjLS 
IJtVSHIQGBGKDPCVBAVCTRGGPaWPnSRAHlIJOSDEDFKPRI 
VPTyRDPNKPYKKVLaTRYIQrrKLGSRBHLLVAVI/rSEArLSTL 
AVAVNRTVAHHFPRLLYPTC^KGAHAPftGNQVVSlfi^ERPAWLM 
SBXlJQIIinHFGADYIWFPIMQDDTYVUAPRLAMiAGHLSINQI) 
L^GRRKBFIGWjEQMYCBGQPGYIJjSRSI»LIAIJIPHIjDGCRG 
DIL3ARP®BWLGRCI*1DS LGVGCVSQHQGQQYRSFEIAKNRDra 
KBGSSAFI>SAFRVHPVSgGTI>TYRLHKRFSM»g LgRA YSB IBQI» 
oaqirnltvltpegbaglswpvglpapptthsrpbvlgwdyptb 
OHTFSCADGAPKCPI.<)GASHADVGDAI£TM»BQI»NRRYQPR l^y 
QKQ fPT J Jan vnpypPABgaflSYTtPIJiLKCVrCtftGHRRALRRRVSL 
LRPl*SRVp 1LPMPYVTHATRVQI»VLPI>LV7\BAAAAPAFLBAFAA 
NVLBPREHALLTLLLVYGPRBGGaGAPDPFWJVKAAAAm^RY 

PGTRIJUCIAVRAEAPSQVRLrtDVVS KKHPVDrn*PFliTTVWT^ 
PHVI^Cra^MMSGHO^^W^SFNPAliSPQRSPPGPPGAGP 
DPPSPPGRDPSRGAPIGGRFDRQASAEGCFYHADYlAAfiARLBG 
2LRGQSBKERLBGLBVMDVFLRPS<UJnjFRAVBPGI»VQKFSIjRD 
CSPRLSBKLYHRCRLSNLBGXjGGRAQLAKALFKQKQftKST 



AGNKPDPWAGflNRTAVLPDVSVFHREDVGWWRSWLQQSYOAVKK 

KSSEAliBPMKHDI*TBPTQVVQHDTACTlAATftSVVKBKIiATEGS 

SGATKKNK}03LSDFljGVISDTFAP3PDKTII>CDVITIiMGTPSG'- 

AEPYDQTKAiUiYSLiQSDPRTYCHKPDGPPKLF DAWLSQ FCLgKK 

KG5XSBLLVGSPS IRALYTKMVPAAVSHSBFWHRYFYKVHQLBQ - 

gQ^RKDAlJCQRAEQSISHEPGWKKKKKELfCISPJSPZBAKVPV 

akistppegepgpqspcsqilvtsvkppaevtpsbssesisi*vt 

qxanpatapbarvi^kdi^kij^aslhbqgiavdvgetgpspp 
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Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino add 





I HS KPLTPAG3TGG PEP RPPAHVKTIjRKBAPTDLR VFKLNSBSG 
KSrPSNHGKKGSSTDlSSDHEKDPDLDMTEKKVQMALSKVnASG 
KVSGPGGSKGSKPNGPGCKSSPQPAOLSPQEGPCSCLS 



AGNXPDPWAfifiNttTA VIiHJVSVFHHJSDVQWMRSWIjQQS YQAVKB 

ksshalkfmxrdltbptqvvcjhdtactxaatasvvkbkiategs 

SGATBKHKKGLSDP1 X V ISl )TP A P3PbttraPCPVlTLMGTPSCT 

AEPTDGTKARLYSLQSDPATyCBEPDGPPELFDAIVLSQFCliSBK 

KGB1SKLLVBSPSXRAL1TKMVPA&VSHSEFHHRYFYKVHQLH0 

BQARRDAL1QQRAE0SISBBPGWKBBBKKI>GISPISPKEAKVPV 

aictstffbgbpgpospckkhlvtsveppabvtpskssesislvt 

QIAHPATAPBAHVLPKDLSQXLLEASLKEQGIAVDVGBTGPSPP 
IHSKPLTPAGHTGGPBPRPPAHVBTl>RBKAPTDLRVFELNSrxSO 
KSTPSNNGKKGSSTDI fiBOHBKDFDLCMTBBBVQMMtSKVDASG 
BVSGP0GSB6SBSSQFGCBSSPQPAQLSPQB9PCSCLR 



GKVCWAHWVCLSILS PPPW3OTPHAQBABGQQTRAAAGRAT0 
SANMTKKKVSQKKQRGRPSSQPCRHXVGCRISHGWKHGDBPITO 
WKGTVUXJVPIHPSLYLVKYDQIDCVYG«I£LHEDBRVI»S1iKHjS 
DRVASSHISDANLAtJTI IGKAVBHMFEGKRGS 1CDKHRGHVLAQA 
PlMXANPYXTYBKDPVIiYMTQLLDDYKBGDLRIMPBSSBSPPTB 
RBPGGVVDGIJQXHVBVTKBDGSKaiawIHQVKAKPSVyPIKP 
DDDPHIYVVDZtVKKS 



GKVCLVAHWVCL3 IL3 PPPAGMXTPNAQBABGOQTRAAAGRATG 
S^TflTKKXVSQKXQRGRPSSOPCRNXVGCBISHGWKEGDKPITO 
HKGTVIiDQVPIMPSbYLVKYIXJIDCVYGLBLHHDBRVLSLKILS 
DKVAS3HXSDANLAOTI IGKAVEHMPEGEHGSKDBHRGMVLAQA 
PIMKANFYITYEKDPVLyMYQLLDDYKBGDLRIMPESSBSPPTB 
REPGGWXX3LIGXHVBYTKEDGSKRIGMVTHQVEAKPSVYFIKP 
DDDPHXYVYDLVKKS 



BVSPPraCCLTVAVADPGVSEGPRGPGAGCBMPGRGRCPDCGST 
BLVfiDSHySQSQLVCSDCGCVVTBGVIiTTTFSDBGNIjRBVTYSR 
GIGBBBQVSRSOQRGLRRVBOIiCStVLOLP PTPBDTAVAYYQQAY 
RifSGlBAARLQKKKVXjVGCCVLITCHQRBWPLTMGAICTLLYAD 
IiDVPSSTYKQIVKIJXSIJDVPSTjiCLAKLVXTYCSSPKI^PQASPSV 
PAIQfVHDKHKMLSRTMQLVELAHBTHLVTCTHPX»P , VTT!AATFLA. 
M2SI^PADSI*SCSLAHFCKLAHVDLPY PASSHbQBLLAVLLHMA. 
EQLAH1 iRVI^LDKRSWKHIGDIJLGHRQSI>VRSAPWX 3TABVKT 
RBKBPPGWGQGQGEG3VGHNSLGLPQGKRPAS PALLLPPCMLK3 
PKRlCPVPPVSTVTGDSNISDSBXBQYIiRTPQBVRDPQaAOAAH 
QAATSVPNPP 



RSHASRLPELPSGCLVLQVQKLVQtiSGMKATVTiPIH^iKPHGA 
AfiSVVRRIGTNLPlJCPGARASFBTLPNXSOCjCUlDVppVPTLAD 
IAJnAADEBBTYARVRSDTRPLRHTPilCPS PLIVMQHHASVPHLR 

GSBBRLLAUCKPAIiPJUjSSTTBIjQDHLSHLRSQXAKIVAAnAAS 
ASLTPDFL3PGSSHV3SPLPCFGSSPHSTTS PVISDXTBBTBVK 
VPBLPSVPLLCSASPECCRPHHXAACSSSBEDpCVSLSKASSFA 
DMMGIIiKDFHRMKQSQPIjNRSLLKHKDPAVIj ISEVLRRXFALKE 
KDISRKGS 



hrarkrpqrwpshrrkvrvppphlqrgrsglbpgtprhmaaarp 

SZiGRVLPGSSVLFIiCDHQBKFBBHIAYPPQIVSVT^ARMLKNTTI* 

DLLDW3LQVHVVVDACSSRSQVDRLV7\IjARMRQSGAPI>STSEGL 

ILQLVGnAVHPQFKEIQXLIKKPAPDSGLUGLFQGQHSLLH 



FVWNPRXjGRKHRROAAVTQAATHASGTPSPKDGTMTQGKLSVAN 
KAPGTEGQQQVHGBKKBAPAVPSAPPSYKKATSGBtaiKAGAFPP 
APTAVPLHPSMAYVDPSSSSSYDirGPPTGDHBLFTTPSWDDQKV 
HRVFVRKVYTIU^QLLVYLAVVAJUPTPtZDPVKDYVQANPGWW 
AS YAVFFATYLTLACCSG PRRHFPWNI*ILLTV FTLSHA YLTGML 
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SSiYNTTSVUjCIjGIXAIjVCLSVTVFSFQTKFDFTSCQGVLFVX^ 

IifniiFPSGIjIIjAIXjLPFQYVPWMAVXWVIiQftGVFTIJTiALirrQ 

mCTRRH3I«SPKKYlFGftIJfnfIJ) HYXPTF PLQLFGIMRB 






GLVINPVQ IAn*LAL9rPVSKQLYRai/fCRIAYSLHS QLVMLLBKW 
SCTBCTLFTI)QftTVBaH3KBHXVIIXAIHHFRXDFXiOGWTHCBRF 
OVLCSSHVLAKKBXiLYVPLJ GfifTWTFlJSXVFCSJtKSfEEDRDTVV 
BGLRRIiSiyYPKYK^PliLYCKGTRFTBTKHKVSMSVAAAXGIiPVL 
KYHLL PR T KGP TTAVKCLBGTVAAVYDVTLHraGMKHPSLLGIL 
YGKKYEADMCVRRPPItRDlPLDEKBAAQ^LHKLYQBXXALQBIY 
NQKraiFPGEQFKPARHPWTLLOTLSWATILLSPLFSFVLGVFAS 
GS PLLTLTPlGPVGAGNGHCa 



GEEQPLAAAPTBCLBQVXGGAGDPGTNAS FPSPLPGPAPIiKEGK 
THATNFSD IVKQGYVKMKSRKIiGXYRRCHLVFHKSSS KG PQRLB 
KYFD2R9VCUtGCPXVTBXSMVKCVT3tLPSBTKDQAVAX 1 5TH5D 
t SuRHTFTODSBLBAKHWYKTLSVECICSRLHDISIiGHPDliLAPGV 
QCEQTCTFNVFIJiPCPjniDVYGBCKbQlTHBiriywroiHNPRVK 
LVSWPLCSLRRYGRDATRFTFBAGRMCTtAGTOLYTFQTQEGEQI 
YQSVHSATIAlABQEKBVLIBMBKKVRLIfiKGTBffifSyPCTPTT 
MLPILSAYWHHITGSQNIABASSYAGBGYGftAQASSBTDIiLHRPI 
LLEPXP8QQDS5BARTPSQ 



FGKKSTWFPLSRSIiRVASGRSCKIjGHQGYTGSGPGPGKPRJDSGA 
EVPSGSGRATGCSRGGVRQRRQGRAPGSaiWRKBPHMVCTRKTK 
TLVSTCVILSGW1WI I O.LYVGWVTHYIASVYVRGQBPAPDKICL 
KKDXGDTLKI I ERLDHLKNVI KQHIQBAPAKPEKAEABPPTD3 S 
LPAHWGQKLS PEGRRVALKQPQYYGYKAYLSDRLPLDRP 



FVSPSPGFSIAQLNLIHOLTDTKQIiVHSPAEGCDOGSAYANRTA 
LFPDLLAQGHASLRLQRVRVADBGS PTCFVS IHDFGSAAVSIiQV 
TVAPYSKPSKTLBPNKDIJlPGDTVTiTCSSTGGTPSAEVFWQDGO 
GVPLTtaiVTTSQMANEQGLFDVKS IUWVLGAIHJTYSCLVRNPV 
JjOQDAHSSVTITPORS PTGAVEVQVT?EDPVV7^VGTDATLRCS P 
8PBPGP5LAQU1LIWQLTDTKQLVHSFABGQDQGSA2ANRTALP 
roXJAQGMRSI^LQRVHVADBGSPTCPVSIRDPGSAR.V8LQVAA 
PYS KPSHTLKPNKDLRIHSimrriTCSSYQGYPEAEVFWQDGQGV 
PLTGMVTTSQMRHBCGLFDVHSILHVVLGANGTySCLVWrpVIiQ 
QDAH3S VTTTPQRSPTGAVBVQfVPEDPWALVGTDATIiRCSFS P 
RPGPSIAQXjNLXWQLTtyrRGLVHSFTEGH 



VGIPGPTISSAAEMKDL1*DLDBELRYSLAT3HAXMGRRA0QSSA 
QAKH}njKKNSSLTT>TGHTSSAKLPRCSQGGHAGDSVK7WSKPRR 
BCASEBI KDPRLRPQSLNOSDYGGDI PXI PDLKBVQBBDFVLQVA 
APPSIQIKRVMTYRDtJ2HDIitKYSAIQTUX3HIIlLKIiLTKVLAP 
BHBVHEBNPSKQDDVGNDVDHLPTBVSSBVl/rBKDPIiQTEKEDP 
AGQARBT 



GRVOSRRRGAAVPGGCGAGSTQLKV 
VHSOGEAGPIS KDRKERVHQGMVRAATVGYGJXREGGSAVnAVKG 
AVVM*BDDPBPNAGCGSVIJmJGBVKMDtASIMDGKDLfiAGAVBA 
VQCTAHPI ECIARLVMERTPHCFLTrQGAAQFAAAMGVPBXPGBK 
LVTBHHKXRI£BCBKH2KGAQKTD0QKNbGTVGAVAU)CKGWVAY 
AT3TQOIVHKMVGRVGDSPCIiGAQGYADNDIGAVSTTGKjE3Hj 
KVNLARLTLFHXBQGKXVBBAADLSIXnMKSRVKGLGGIirVVS K 
TQDWVAKWTSTSMPWAAAKDGKLHFGIDPDDTTITDLP 



PRRLRYLGQQftTAAARDLRTRGIiQGYCPSATARQQVLVSALGQL 

KGRRSBHRHKBQBMPYSTNKELII/ 3 IMVGTAGISIJ*UjWYHKVR 

KPGXAHKLPBPIiSti(9ITF1SSITbQDBIQ]>DQG7rVIFQHRQIiQI 

l^KLNELLTHMEBLKBBIRFLiCEAIPKLBBYICa)KLGGKrrVHK 

ISPQHRARKRRLPTIQSSATSWSSBKAKSBGGYrrAHTDTEEQS 
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Amino acid segment containing signal pept 
(A=Alanine, C=Cysteine, D= Aspartic Acid, B= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H^Histidine, l=l8oleucine # K=Lys±ne. 
Lt=Leucine, M^Knthionine, NaAspaxagine, 
p=Prollne, Q=Glut amine, R=Ar g i nin e, 
$=>Serine # TaThreonins, V«Valine, 

W=» Tryptophan, Y=Tyrosine, x~Onknovn, *=Stop 
Codon, /-possible nucleotide deletion, 

\ sponsible nucleotide insertion) 



FPVPKAFHTRVEEliHLPVUiQKVDHltBMSBSGKSKSF 
EXFRDBIBFHS9RFARAYGDWYKLS I N fUXRKUYAHlGtCTLSBRA 
IMRAPWSGHCHIJrrAVW^YVSBFEGLQHKDrXGHI^KKHLDIA 
IKLLPBKPFLyYUOGRyCYTVSKLSWXBXKMAATLFGKXFSSTV 
QHAI£HPLKAEEIX3XTYEaiFlJYMYIAKCYn>LEBHQNALKFCtJL» 
AlililiPTVT K BDKgAQKEKOiDJCTSLKR 



GBVHOOSSSUiAmnfVRGaABDyKHWQHQCABGHD 
ECAQGHHLGASRYRGAIXJPLRVSRGIcnnCTIiHCAFIJaYQQAGTP 
LTKDHNGFQQBGFGWIDMTIHEGKRWSAACArLHPALSRTlTLBA 
BABTLVBKVLPBC5TRAVGVEYVK1IC3QSHEAYASKBVILSGGAIH 

spqllklsgigidvddlkxlgipvvchlpgvgqnlqijhibiyiqq 

ACTRPITLHSAQKPLRKVCIGIiBWLWKFTQBGATAHLBTGGFIR 
SQPGVPHPDIQPHFIJ^OfVXDHGRVPTQQKAXaVHVGPKSGTSV 
GVOiKIfifiANPQnHPVIQPNYLSTBTDIEDFRbCVKLTRBIFAQB 
ALAPFRGKKIjQPG3HIQSIJKBlnAFVRAKM3SA3fHPSCTCKMGQ 
P BDPTAVVDPQrreVLG VEHLRVVIIAS IMPSMVSGNTjNAPTIMIA 
BKAADIIKGQPAWCDKDVPVYKPRTIiATQR 



PRTaVHPRARSFPLPPPRAPHHRJlHRbliRAVPGPSR 
RHKCRRRAPPPPSTPMGHAGSHRSKAPSLPPRCPCDPWOSSKTKN 
LCSKCFADFQKXQPDDDSAPSTSNSQSDLFSBBTraraaDtTSIT 
TPTLSPSQQPbPTRI'RVTS P3KKBGGPCTDXAHVSUTPT3CRSC 
GTDSQSBHBASPVKRPRLLEHTBRSBBTSRSRQKSHHRCFQOOrr 
KIiBLVQQBLGSCROGYVFCMLHRLPBQHDCTFDHMGHGHBEAIM 

kmvkldrxvgrscqrigkgcs 



KRKKAKPPSPVttCVAAAAPTATVSBKEPP0KLQLSGRDPPGSL3 

AXKVRrSEKECAPHRVNGEGGSGGNSRQLQPPAAPSPQSYOSPAS 

wseaplsaapspsssiwsfsfsagtavpssasaslsqpgprkll 

VPPTLLHAQPHHLLLPAAAAAASANAKSRRPXBXREKfiRRRHGL 
GGARHAGGASRBBNOBVKPIjPRDKIKDKXKBRDTCBKKRBKKKHK 
VMHBIKKgBGBVKHJjKSGKHKPKXWlEDUOlKXV inntKKKXHK 
BHKKEKRPKMTSKSIQTICSGLiLTDVBDQAAKGILNDNIKDYVD 
KNLDTKHYDSKI PKfiSBFFFVSWCEPRVQfWLKRbDTLBFKQLI 
HZBSQPNGGASV1RCXQ 



PVHCVAAAAPTATV8BKBPFGKLQL93RDPPGSLS 
AKKVSTKBKKAPKHVNGBGGSGGHSRQLQPPAAPSPOSTOSPAS 
HSFAPLSAAP3PSS6RSSPSPSAGTAVPSSASA3LSQPGPRJCL1* 



GGAHBAGGAS HKgS G gV KPLPRDKIKPKI gSRD KK KB REK KKHK 






ENKKRKRPKMYSKS IQTICSGLLTDVBDQAAKGIUSlHflKDYVt? 
gHXjyrKHYDSXIPHHSEFPFVBI^PKV^njKKlOTI^FRaLX 
HIEHQPHGGASVXHCLO 



KLFKKATDT3QGKLVHPKAliPbXVGAQblHADKI/jKKV2DSTMP 

IR3TVKSTRBTPPKSKIABGBBKKPBPDISSKBSVSTVSBQKNB 






vkxtxpsratlsasquuzaokgtpmassprpxkdailtraikac 

fqksgasvvaxrxyixhkypslklsrhgyijjcqalkrblnrgvi 

KOTKGKGASGSFVWQKBIUaTQKSRNHXNBSSAVDPBPQVKLB 
CVLPLAFTIUjCBPKEASYSblRiQfVSQYYWCLRVDIRPQliIiXNA 
&QlUWERBQLEQlTGH3ASGTFQliKKSOTKPI»I/jGSUtEYAILS 
AIAAWNBPKTeSTTAIJCKYVLBHH P lal'MSM Y O M HLbK gTLQKCg 
KHGWMBQISGKGFSGTFOLCFPYYPSPGVLFPKKKPbDSRDEDE 
HBDESSBKDSFDBHPPPKRRL0KKTPAKSPGKAASVKCEOSKPA 
PKVSAAQRGKARPLPKXAPPKAKTPAKKTRPS STVTKXPSGGSS 
KKPATSARKB 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 







RKEHRPVSPKRYSYGTSSSSKRTSGSCRRRRQSSSSAKSQQGSP 

PTKRQRRSKGRPSGGAHRRRKGP0PHPSSSQ3PPDLPLKRK 



PIALAMASNrtroXVKQGYVKI 
ESSKLGIPRHCHLVFKKASSKGP&RLBKPPDHKTtfiYFBHFHKVT 
ELHNXKHITRIiPHHTKKHAVAI IPHDCTSKTF7VCSSKLSAEHKC 
KHLCMECLGTRLM)ISIiGBPDLIAAGVQREQNBRFNVYLMPTPIf 
LDIYGECmQITHENIYLWDIHN7VKVKr.VMWPLSSLRRYGRDST 
WPTFBSGRW3yiX3BGLFT¥XyraBGHMIYQgVH5ATIJ\TABQHHR 
LMIiSMKQKARIiQTSLTEPMTIiS KS I SLPRSAYWHHITRQNSVGB 
IYSLQGMHBMRHSDLTGKSCKTSEHRPLEEHAPLVMYG1THHLF 
MDTSTCKWHDLE 



ASCQ1KRRTAAI*0SGHRLAGRRSPIA1AHASNFNDIVKQGYVKI 
RSRKLGTFRRCTtLVFKXASSKGPRRLKKFPDBKAAYFHHFHKVT 
KMNIKNITRIiPRBTB*HAVAII PHDBXSKTFRCBSBLKABEirc 
KHLCMBCLGTRLND IS LGBP DLLAAGVQRK^KRFNVYI>1PTPH 
LDrYGBCTMQITHBHIYLHDlHNAKVKLVMWPLSSLRRYGRDST 
WFTPK8(aWa>TCBGI*FTFQTREGEMIYQKVHSJVriAIABQHEB 
U4LEMB0KAHLQTSLTBPWLSK6I6LPRSAYWHHITRQKSVGB 
I YSLQQQIKRRHSDLTG JCSCKTS KNRFLBKHAPLVMYO XTHHLP 
MDT S TC KW HDIiB 



GBSPPSBBBSIBFNPSSSGRSARTVSSNSfCSDOTGWPSSQSVS 
PVKTPSnAGNSPXGPCPGSDBGPTRKKCTIGMVGBGSIQSSRYK 
krsksglvkpgskadfssssstgs isapkvhmstagskrssssr 
krgphgrshqasshkpgsspssprkkdllshlcrnqlspvnihp 

SYAPSSPSSSNSGSYBGSDCSPIMRRSGRYNSCGKNHaVRPPNP 

eqyltplqqkkvtvhhi.ktklkbsbrrlhbrksbivhlksqlar 

NRHDHIEESCHRVBAQIAXjKEARKBIKQIJCQVIBTMRSSLADKD 
BgiQKYPVDIBXQWKKLBSLIiQSMKMAHSGSLRDKLCLDPPCDS 
PEKSLimiPPU7IMADGLSZjBBQVT6BGAl]!RBIaLVCTS IANSTD 
LFPSIVTATrrKSGIJLBLVHSTPQANVLSLLPIVMGQEEGSVW 

kravqtdwpyspaxsbkiqsvloklqdpcpsslasrdksepds 

MBS PPKSLSALVVDLTPBNRiSAILLS PVBTPTMJVnAEVHANR 
IWRKUJPMlCVKSRIjDGVXPLJUlGGVVROfYHSSSPLVDLLAVftA 
PWPlVT^AySaX?g^GTDPVYNTQAlxIJg(yrVVATJT??T,y?RrAFp 

ucr 



SGRAGBPBBHEGKQIXDSKE1MIPFNSEDSQQI*BEAYSSGKGCN 
GRWPTDGQRYDVHI/XRMRYAVYWDKIjASKVRRC1VfFYX£3DKD 
NKYVP YSRSFSQVIiBBTYMIAVTLDHWXKXLBS PHRBII ILHHP 
KI/4VHYQPVaGSDDWGSTPMEQGHPRTVKRGVEHISVD3HOGKP 
LQIDHLVPWHGIGPACDLRFHSIVQCVNDFIlSVSLNIjIiQTHPK 
KAQKWQQIGRVKPU?VNWHSPIBSTOVDVDLQRITLPSIJfHLRH 
PraiXrililVPPYNBPTYCOlTVDTVASBMNRI YTIaFLQRNPDFX 
GGVBIACHSMSLir^IiTKQKDSLGDIDSRKBSUIXVMDQGD 
TPTLBgPLKKLQLSHFFDI PKKEKVDKKJVLBLCTDRDLQBIGIP 
LGPRKKILNYFSTRKNSMGIKRPAPQPASC5AH1PKKSBFCSSSN 

trngdyldvgigqvsvkyphliykpkippapgspigmfltvrgl 
kridphtyrpptckdfpniyhppdpvayrxbpmvvpgvbpbpmli 
roBKGPKSHHLBLhBQLTBHSMDLKliNliLGSLRnANKSFTRAPY 
PALQAS BrPBBTBABPKSTS EKPSDVNTBKTSVAVKEKVLP INV 
GHLNGGQRIDYVLQEXPIESFHBYIJPALQSHLCYYfBSEDTVIjLV 
LKKIYQTQGIFLDQPDQ 



GWVPl<LSRXI>PSDflCmYKlOSXNIBI«DYTUDP*M 3 MKGQRGDLS 






EDIYSATLSTKSISPTRAQTOHLPaRDKTBRVCMFLADPYLVNO 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid ‘ 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanlne, C^Cysteine, D*Aspartic Acid, K=» 
Glutamic Add, P° Phenylalanine, G=Olycine, 
Hagjstidine, I^Isoleucine , K^Lysine. 

L= Leucine, M=Methionine , N=Asparagine, 
P=Proline, 0=Glut amine. R^Arglnine, 

S=Serine, T=Threonine, Valine, 

Vfc. Tryptophan , Y=Tyrosine, X«Cnknovn, ♦-Stop 
Codon, /^possible nucleotide deletion, 
\«=possible nucleotide insertion) 






- 


ITDSTGTHLVLTVTHKNAHSPGLSRGSPQQPSSQPGSPAPAPSA 

QMDLEHPtiQPLPGTPTSLliKKEPPGTESAHSQQPKQQQrcSSSQ 

QKDXII*FDILIQSG51SADFXKPPSLPGKKKPSPXTVCHSPLAAQ 

PSPSAKLPQAAPPPPGSPSLPC31LRDFLSSSTGLPLLTSGHDGP 

BI'LSLIDDLHSQWLSSTAlLDSPPSPMiyrSKLHPVPBPSSTHGL 

DIADGHLDSJffiWLBLSSGGPVLSLAPLSTTAPSLPSTDFLDGHD 

LQLHWDSCL 


6533 


1798 


373 


SnSWIARVKPPRHSSGVQAARIjRPPGGSRPLRARACVLAlAVL 
ALLBRNNADSMSAHSMLCBRIAIAKHLrKRAKSLSRSaiDGGIEG 
GAKLCS ECLKAKLKFLQKVKAGXVAIXKSHLQS’TNLTHLKAIVES 
AKNLBEWSVIJHWFGYTDTU3BKOT1»VVDVWNG<3HTWVKAIGR 
KABALHNINLGRGQYGDKSIXEQAHDPLOASHQOPVQySNPHII 
FAFXNSVSSPMAKKLKEHGISVRGDTVAVNAIiLDHPEKLQPSBS 

bsdsbgpellqvtrvdhehilasvafptbikvdvckrvnlditt 

LITVVSALSYGGCHPIPKBKVLTBOAEQBRKBQVLPQIjBAPMia) 
KBLFACBSAVKDFOSXLDTLGGFGBRSRATVLXKRXNVVFDQPS 
ERAIiEtLVASSEdBSRSLTI FGTGDTLKAITWTANSGPVRAANNQ 
GVKFSVPIHQPRALTESKEALATPLPKDYTTDSBH 




47 


596 

I 

1 


KKELQCFPCEAHLLELIPLLGXHFVLRTAHAQSVIXlPDIHrrVS 

HGA3IJSLRCNY8YGATPYLPWMBRTVEKAFILLVCLKPWRVA33 

LSKKBKBDSSFQIiLLGSRyNVIjKAHCLLPLIRSiLTSCTJSIiLSAQ 

PHCPQGL 


6535 


250 


964 


luctffrdvaiqrdllpkbknlbtlltlaplkidkas^sharlS 

ADATLLTSGTTATVALLRDG I KL VVAS VGDSRAILCRKGBCPMKL 
TIDHTPBRKDBKKRI KKOGG PVAHNSLGQPHVNGRLAMTRS IGD 
LDLKTSGVXABPETXRIKLHHADDSPIiVLTTDGINFMVNSQBIH 
DFVNQCHDWJEAAHAVTEQAIQYGTBDN3TAWVPPGANGKYKN 
SBINFSFSRSPASSGRKA 


6536 


242 


1174 


5LVKE»tTNQYGII*FKQEUAHDnAIWSVAJ#GTNKKENS BTWTCS 
U2DLVKVtnaoU3HRLDLQWSLEGHQLGVV3VDISHTLPIAASSS 
IJlAHIRLSroi^nSKQrKSinAGPVtXMrriAPSPDSQYLATGTHV 
GKVNIPGVESGKKKYSUJTRGKFXLSIAYSPDGKYLASGAIDGI 
miPDIATGKLLHTLHGHAMPIRSLTPSPDSQLLVTASDDGYIK 
IY1WQHANLAGTLSGHASWV1 iHVAPCPDI)THPVSS 3SDK3VKVW 
DVGTHTC^ITFFDHQDQVWGVKYNGNGSKrVSVGDDQBIHXYIX: 
PI 




1638 


921 

I 


tfRFNPPPTQGPDPSLVYRPDVDPKVAKDKAS PRNYTSGPiLDRV 
PTTYKUfflTHQTVDFVRSKHAQPGXSFSYKKMTVMEAVDLIJQGLV 
DSSDmrDFPNSPHAFQTAKGIRKAHPDKDHFHLVGLLHDLGKV 

iai^gspqwavvgdtppvgcrpqasvvfcdstfqdnpdlqdphy 

STEI/3MYQPHC1»LDRVI2<SWGHIX5SARGGQWGGGGRWGTVGGGG 

ABAVPAGDTL3PQSTCTR 


6536 


3345 


2412 


PYLYDFLDALITCQTAPKKAFIKLDGIAfflCLTEQIiRRLTKQVQB 

ARHHRDDKAIKKAVNBYDBTMKKYlPVLMAOAKIYMNLKNyPMV 

BOPRKSVBFCJinHDVHBDCAIVAHVLPMQBDHKYKBAIGFIBPIW 

KHYISWLNVSArVLANLCVSYXIfrSQNSKAKKLMRKIKKKKBQI* 









SYDDPNRKMYHLCIVNLVIGTLYCAKGMYKFGISRVIKSLEPYN 

KKLGTDTtfYYAKRCFLSIiLSNMSXHMIVIHDSVIOBCVDPLGHC 

RLYGTNIPAVTBQPLBKKRMHVGKNTVTDSSRQIiKALIYEIIGW 

UK 




218 


339 


■ 0 * f*J: i n *t y :i y »:#} * :*;Vja *; ; e » v 


m 


3 


391 


t.VCWT.MT.T.T.T.’gP P RH AM ARP PTT UTKAVTTHPDRlJHAWllKif V YLDB 

kqhahi^ltieikdrlqlrvijlrrkdvvlgrpmtptqigpsllp 

IMBQLYPDQRYRSSDSSPWRLVYHIKIDGVBDMUtELLPDD 



5 




























ESSSLQGMDMASLPPRKRPSaHDGPGTSHHHEMKAPGGPSBD^ 

KG R GGPG PAQRVPgSGRSSSLDGEHHDGnTHRPgPFGGPPGSGTP 

SRCGRSGSNWGSGSHMHSGPPHRGASfiGGGRGR 



KROFLS KKTAEASRWHEKWPAL YQgTVLKYFEGKQSCRPAC^fSrfiL 
EGCSCKRTPAPpRAGAGQGGVEnALDKQYYFTVUFGHEGQKPLB 
IiRCBKKQDGKE>e<KAIHQASYADILIEBKVL>5QKYIHLVQrVKT 
BKIAANQLRHQLEDQDTKIKRLKSRI ialhktkkrmrpyosnqe 
DEDPDIKKI KXVQSPMRGWLCRRJWKTJVQDYICSPHAESMRKR 
NQrVPT?4VEAESBYVHQLYXLVHGFLRPLHMAASSKKPPISHDD 
VSSIPI^BTtMFLHBXPHQQLKARIAHHPTI*IIiADLPDirJiPM 
I^IYQEPVRMHQYSLQVIANCK®nU>FDKIiIJOQyEANPACEGiai 
LETFIjTyPKFQIPRYirrtiHEliLAinPHEHVBIUtSLEFAKSKLB 
KLSR VKHDEVSDTHHIRKNIAIBRMIVKGCD I LLDTSQTPTRQG 
SLXQVPSVBHQKLSKVRLBSLSLKBBBHHOCFLPnCHyLXClHS 
SGGK3jHLX^KTGGVI>SLXDCTI*IEBPnASDDDSKGSGQVFGHIJ)P 
KI VVBJPPDHAAPTVVLLAPSRQBKAAHHSDI SQCVDNXRCNQLiM 
TIVFEBNSKVTVPHMIKSDARLHKDDTDICPSKTLNSCKVPQIR 
Y7«VKRIJ^RLTDI^FI*SIDELHTFLHTYRIFTTAAVVIiSKLSD 
XYKRPPTS I PVRSI.gr»PFM9^HRggHLVDG KS PRLCRKPS5 P 
PPIAVSRTSSPVRARKLSLTSPI^SKIGALDLTTSSSPTTTTOS 
PAASPPPHTOQIPIJJLSRQLSSPBQSPGTVBKNVnNPRVDLCHK 
LKRS IQKAVI*ESAPADRi\GVBSSPAADTTKLS PCRSPSTPRHLR 
yrqpgggttadnahcsvs pasapaiatraagkgs P PGFNNTKRTC 
DKRP I IRIHIATKRVT^NVLRHWVSKHAQDFELNNKLKMNVLiNXaLK 
KVUUJPDLLPQBRKAAANILMALSQDDQIJOIHLKIiSDI IQHTDC 
MKABCPBSLSAMRLAEQITLU3HVIFRaiPYKEFLGQGWMKI^K 
NBRTPYIMKTSQHPlffiMSHIjVASQIMHYAnVSSaANZUSrWVAV 
ADICRCLHNYHGVIjBITSAIJOTSArYRLKKTIfAKVSKQTKALMD 
KZjQKTVSSBGRFKNLRBTIiKNt^IPPAVP'YDGMYLTDIAPISBGT 
PNPTBEGLVHPSKMWaSHI IRKXRQPQQTSYRIDHQPKVAQYL 
LDKDLI IDBDTLYBLSLKI BPRLPA 



PVSGCX5RAGIGLSWA>!AARARVSRWYFGGLA60GAACCrnjPI»DI* 
UCVHLOTQQBVK1iRMXGMAIjRVVRTDGILALYSGI>SA3IiCSOMT 
YSLTRFAXYBTVRDRVAKOSQGPLP PHEKVLLGSVSGLRGGPVG 
TT?ADLVNVRMQM)VKLPQGQRRNYAHAliDGI»YRV7lRKBGZiRRX»P 
SGA3MASSRGAJjVTVGQX*3 CTIX1AK0LVLSTGYLSDOTFTHFVA 
SPIAGGOmT/^PUTVIiKTRmJSKGKYQGVFHCAVKTAKLGP 
LAFYTCC^VPAGI RlilPlHVLTFVP’LBOLRKNPGrKVPS 



PSPCPXKSRXJXyOPNHAGLBAMLSQNFSLHQPQSR VRVRRAS X S 
BPSPTDPBPRTIiHPSPAGHFVQQHP BIiBLW SSFRKRPGRHKLQY 
RSHLEPSGNPLPATPTT3APSAPPASSQGPDTAPRPS PPQBEAR 



BGRQKECP 



P PHSHAALLPAAMTPLI>TLILVVI^LPLAUfUJ3CHVaVYKGDN 
CFNPKRCPAMVAYCMTTRTYYTPTHMKVSKSCVPRCPETVYDGY 
SKMASTTSCCQYDLCIJGTGIiATPATLALAPIIjyVXLKGIiL 



HIJjNGXjDBvAAFPVADIiGAIVRKHFCFTjKCLPRVRPFYAVKCNS 
SPGVliKVLAQLGIiGFSOUSKABMBXiVQBIGIPASKTICAKPCKQ 
1AQXKYAAKHGIQLLSPDNEKELAKWJCSBPSAKMVLCXATDDS 
HSLSCLSLK KGVSUCSCRHT JiKSAgKBHVgVVGVSPHIGSGCPP 
PQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFPGTEGAKVRP 
HEIASVINSALDLYPPSGOGVDIFAKLGRYYVTSAPTVAVSIIA 
KKBVLLDQPGRBKEHOSTSKTiVYHLDEGVYGI FNSV^FTOJTCP 























WO 01/53312 



PCMJS00/34263 





Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


fwrfB 








TP ILQKKPSTKQPLYSSSIjHGPAVDGCDCVAKGI^LPQUiVGDH 
LVFDNMGAYTVC34GS PFV9GTQACHITTCAM5RVAWEALRRQLMAA 
^BIIDVBGVCKPLSCGWBXTOT1»CVGPVCTTASIM 


6547 


1 


541 


LHSKyi!iAPAljCSQPGMHRCCRRR(yCRQPPHAT^PT.T.T.T.PT.VT . 1 . 
PPIAAAAACFNMllUTf 1 Qt'i FARf'T.TITT jRTT 

HDPHRCRSQVLSGCPEBAAAVWBSLQQKABQAPRPHNLHTtjCGA 
PVHVRERGTGS BTKQBTIjRAiaPAItETiAPAPPLIAAALAlAYLL 
RP 1 A 


6548 


2 


219 


PVSRLS VRDVRFPTFLGQfiSADAMHlTlPD Y SAAYVP I BTDABDG 
I KjGOGrTFTLGKCTKVGELKlliSfiPQiJA 




73 


1490 


KTGRVCKDARPACGSaSRRRRKEAAPaiPTPSPSSSSPTSSRPA 

AHAF 8 KAPARLSRPRARKBPPDPGRRYIQBBIIQARKHKLIKKC 

SSVAAKLWFLTDRRIRBDYPQKBILRALKAKCCBBEIJ3FRAVVM 

DBVVLTIHQGNliniHIHEKLITAyPQVVVVRVPTPWVOSDSDrr 

VIiRHIj£KMGCRL#flNRPQAXIjNCVinCFWTPQRLftGHGVPI*PDTPE 

YGGH3NPAKMIDRAHVbBFPMVVKNTOGHRGKAVPIjARDKHHLA 

DI^HI^RlffiAPYIJTOYYVKKSHGRDVnvrVVGGnVV^^ 

DGRMQSNCSLGGV<^CSXiSECGKOI»AIOVSKILGMDVCGIDIiL 

MKDD6SPCVCSAIiAHVQFIAFDEn£39LOVnSX IADYAASLLPSG 

HLTRRHSX 1 LSWSTASBT 8 BFBLOPPASIAVDHMSASSSSVDSD 

PR3TKRSr J LTIOiPG01^7IM»Q^LAigBrKIiLVD 


1 


2293 


922 


LFVSLIQFLI IUSLVI^MVYGHVHVSTESNLQATBRRAEGLYSQ 
LliSLTASQSNljTKKLNPTTRAKDAIMQMWIJlARRDLDRINASFR 
QCQGDRVrYTF^RYMAAIILSBKQCRIXiPKDriHKSCnALLFML 
NQK\nCTI^VBIAKBKTICrK3)KR3VIiI2?KRVABEQI»VECVKTRB 
IXJHQKRQLAKBQliQKVQALCIjpZiDXDXFEHDLHHIjIfROS 1 1 PRS 
U5NLGYNLYHPLGSKrASnWACDHWPSl>tSSKVEBLARSLRAD 
IBRVARBNSDIiQRQKLKAQQGLRASQKAKQKVEKBAQAEBWtLQ 
ABCSRQTOLALBgKAVLRKHRCMmpgLBBKKREABCLRMKLAl 
RKSAUXKrTKTKSQPW*WSRP«OPVPHPQPlDPASLBBPKRKI 
LBSQRPPAGI PVAPSSG 




157 


748 


IQP PDPRNMTIAAY KBKMKKIiPLVS liFCSCFIADPLNKS SYKYB 
MJTVDLHWCVI SDMBVIKLHKCTSGQS PBVX1KPPSFDGVPBPN 
ASI*PaRRDP5LBBIQKKLKAAKSRHffY0BABLI»KHIiAHKRBHBR 
BVXQgAXBgMMHPIKMftKBKLftOKMBSHKBHRRAHIiAAMLEIUjQ 
KKDKHAEEVRKNKELKBEASR 


6552 


| 157 


748 


IQPPDPRNMTi»AA.YTCRKMKKLPLVSI^^5CFLJU)PI2IKSSYKYB 
ADTVDLUWCVISIMEVIBLiraCTSGQSPBVILKPPSFDGVPBFN 
AJSLPRRRDPSLBRIQKKLKAARBRRKYQBAKIjLKHLAEKRKHBR 
BVIQKAIEEMNHP1 KMRKBKLAQKMESHKBHRRAHLAAMLBHiQ 
BSDKHASBVRKNKBLXBBASR 


6553 


2 


1007 


PVWSHMAAHI3YGRVWLNVIiRBAVRRK1iRKFIjDX.CAGSKAIVWI) 
BYLTGPFGLIAQYSIXKEHEVBKMFTLK3GMRLPJJU3VXWI I PFV 
RPRLBLMDI lAEHVLSEDRRGPYBDPHILPVPRRSLbCEQRLKD 
LGVLGSPIHREBYSLDLI PFDGDLLSHB3BQAPKHCYLEGDQTS 

lyhaakglmtlc^yqtxpqifgkgecarqvanmmirmkrbfto 

SONS I Ff^FimiiLLDHNVDIiLTPIiATQLTYBtTLIDHIYGiaNS 
YVKLPPEKFAPKKQCTGGKDLPTBAKKLQIiNSAEBLYABIRDKN 
FHAVGSVI^KKWCI ISAAFKERHNWCTVGKI KQFVSQLPHMQAA 
Rt^LANRTSIABLIKDVTTSRDFFDKLTVKQKFWSGI DTDKVNN 
YIEDCIAQKHSLIKVLRLVCLOSVCNSGIiKQKVLDYYKRBILQT 
YGYEH I LTLHNLRKAOLLRPQTGGIfflNYPTIHKTLRLHMDDVlIB 
QNPTDlSYVYSGYAPLSVRLAQLI^SRPGWRSIBBVLRIIiPGPHF 
BBRjQPLPTGLQKKRQPGSNHVTLIFFLGGVTFAKIAALRFIjJSQL 
SDGGTBYVIATTELMNQrTSWIRAIiMSKPF 



521 
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■■I 


6554 


119 


1244 


FEMGSQVSVBSGALtfVVIVGGGFGGIAAASQLQAiUTVPFHLVDM - 

kdsfhhnvaalrasvbtgfakktfisysvtpkdnfrqglvvgid 

LKNQMVIiQGGKflLPPSHLILATGSTGPPPGKFNHVSSQQAAIQ 
AYKDMVRQVQRSHPIVVVGGGSAGVEMAABliCrKYPKKBVrLIH 
SQVAIJU3KBLLPSVHQKVKKIIXaKEVQLI*LSBRV3NLKKLPLif 
BYHBYlKVOTDKirTBVAOTLVXLCroiKIHSSJVYRKAFBSRIiAS 
SGALRVHKHLQVECfflSHVYAI(3)C!ADVRTPH>lAYIiftGLHAlJXAV 
AH IVH3VKQRPI£AYKPGAi»TFI«I»SMGRNDGVGQXSGFYVGKU1 
VRLTRSRDLFV&i SWKTMRQSPP 




1552 


498 


IHMALLRRIKQVLLFLLrUTliCVIl^fKRVHKGlVPKNBAbDBSB 
TPEKLEBEI PVVICAAAORMGAIMAAINSiySIITDANIZiPyVVG 
LRNTXiTRZRKW I SHS KLRB INFKIVBFHPKGIiKGKIRPDS SRPB 
IAQPLNFVRFYLPLLIHQHBlCVIYI£>DDVrVQaDIQKLYrrrrnA 
I/2HAAAPEDDCl)LFSAQDIHRLVGIiQOTYHGYU>YRKKAI KDIjG 
ISPSTCSFNPGVIVAHMTEHKHQRITKQLEKPfKQKNVKBHLYSS 
5LGGGVATSPMI*IVPHGKySTlNPI*HIiaa^WNPTmRYSKHPL 
QRAKLLHKNGRHKPWDPPSVHND1WSSWPVPDPAGIFKLHHH3 


6556 


241 


1449 


ASIX^CGCFFVlH\^VIlLPSIiOSPPTFGFLU)lDGVLWCUIRVr“ 

PAALKA3PRRIiVHBQGQLRVPVVFNyiHftffiJIbQHSKAQSIi3ALI13 

CEVCADQVTLSBSPHKLFSBYBBSRMLVSGQOFVKBtlAQGI^GFR 

HVVTVDKLimAFPIJJMVDIiKRRLKTTPIjPiaroFPRrEGVbLLO 

BPVBWBTStOXiIMDVlliSMGSPGftGIiAXPPypHLPVljASHMDLI* 

WMAEAKMPRFGHGTFLLCLBTrTOKVTCKEELRYKGU«3KPS ILT 

YOYAEDLIRRQABRRGWAAP 1 RIOjYAVGBOT>MSDVYGANLFHQY 

LQKATIUK3APKLGAGGTRQQQPSAS0SCISILVCTGVYNPRKP0 

ST KPVLGGG BP P FHGIIRD1C FS PCLMEASHWND VNEAVQ LVFR 
KBGWALR 


6557 




1534 


RMOGRTS CHLPRDVLTRACAYQDRRGQQRIiPEWRDPDKY CP S YN 
KSPQSN3PVLLSRLHFBKDADSS5RI IAPMRKGLVPSWPKBSDP 
SKI^PNTlliCELSDTVMBKRSPlCVPLnKGRRCVVLADGPYBJfQRC 
QGTOQRQPYFXYFPQIKTKKSGSIG3lADSPENWBKVWt9JWPI.LT 
MAGIFIX!WEPPEGGDVLYSYTIXTVDSCKGLSDIHHRMPAILDG 
BBAVS KWliDFGKVSTQEALKLIHPTHNZTFHAVSS WMNSRHNT 
PKOjAPVDLVVKKELRASGSSQRHIXJWLArKSPKKSDSKTPQKK 
BSDVPQWSSQFLQK3 PLPTKRGTAGLLBQWtiRRRKKKH pttftKR P 
YSQ 




21 


1138 


PHURRRGGRKMHLGSCLBGGRBAftBBBGBPKVKKRRIJ, jCVSFAS 
VASCDAAVAQCFLABNDWBMERAUI3YFBPPVSBSALERRPBTI 
SBPKTYVDLTNSKTTDSTTSKISPSEDTQQffi3Ga4PSI*ITWNID 
GLM^NLSKRARGVCSYIiALYS PDVIFLQBVIPPYYSYLKKRS S 
HYBIITCHffiEGYFTAIMLKKSRVKLKSQBX IPFPSTKMMRNIiLC 
VHVNWSGNKIiCLMTSHLBSTRGHAABRMHQIJMVIiKKMQBAPBS 
ATSTIgAGPTNliRDRKVTRCGGXiPHNXVDVWB FTiGKPKHCQYTWD 
TQWHHNIXalTAACKLRPDRIPPRRAABEGHIIPRSljDlJ/Tr.KKT. 
D03RFPSDHKGLLCHLDIIL 


6559 


3 


364 


™™bi 


6560 


3 


1435 


TAISGGIWLWUWRCHVfPRPLPQSCVGTBGGLQVRDTSSRlAKG 
GVDHTKMSLHGASCXfflKRSRDRRRSSDESRDSSHBRTBSQLTPC 
IRNVTSPTRQHHVERKKDHSSSRPSSPRPQKASPNGSXSSJUmS 
SBHSSQSSSDGSCKTAGEMVFVYKHAKEGARHTRTSBRVTLIVD 
HTRFVVDPS I PTAQPNTHI/3RMPQSGRBHHFFRPHBKGBYBVAB 
G I GSTVFRAI LDYYKTGI IRCPDGIS IPKLREACDYLCI S FKYS 
TXKC&DI^ALHBBLSNDGARRQFBFYLBENIIiPLMVASAQSGBR 
BCHXWLTDDDWDWDBKYPPQMGKBYSQ1 lYSTKLYRFFKYIE 



522 
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Amino acid segment containing signal peptide 
(feAlanine, CaCysteine, I>- Aspartic Add, E=« 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Hietidine, I^Isoleucirre, K=Lyaine, 
L»Leucine, H-Me thionine , NoAsparagine, 
ps=Proline, Q*=Glutamine, R=Arginine, 

S=Serine, T^Threanine, V=Valine, 

WaTryptophan, Y«Tyrosiite, X^Uaknovn, *-Stop 
Codon, /=possible nucleotide deletion, 
\=pcseible nucleotide insertion) 








^BRbVAKSVLKKRGLXKIIdjGIEaYPTYKBKVKKRPGGRPHVIYN 
YVQRP FIRMS WBKKEGKSRHVDFQCVKSKSrTNLAAAAADIPQD 
QbWMHPTPQVDBLDILiPlHPPSGSSDLDPDAiQHPMb 


6561 


3 


1086 ~" 1 


PCPMAALTDLSFHYWFKNCNLVl2JLSBKyVPlTGCDSGFGNLL 
AXQLVDHGWQVIJUlCFTEEGSQjaXSU^TSYHXiQTTLliDVTKSBS 
IKAAAQf7VSDKVGBQGLR7ULVHHA£7VGI>PSGPNBTft*THDDFVKV 
IHVNI»VGLI3VTLHMLPMVKRASCTVVNMSSSGGHVAVXGGGTC 
VSKPGVKAPaDSIHRgbYYPGVyVCIlgPGMYHTAlIiGKHHIiBS 
HMRKLtfEHl»PQBTRDSYGBlrrPRI YTDKLXN1MQ VAK PHVHDVI 
HSMEHAIVSRS PRIHYHPGLDAKLLY I PLAKLPTPVTDFH»SRY 
IiPRPADSV 


6562 


1 


1562 


MSlTiYDTBAHlCRQLmy^VS^DJTmfATJIQgBTmTPItTJgWTJTOV 

LYBWFLGKRSSGVFVSGPMLXBKAKDFYB^fQLTBPCVFSGGHXi 

WRPKAHHGljaOjDASSBKQSADHQAABQFCAFFRSIiAAHHGbSA 

BQVYHADBTGOjFHRCIiPHPTPBGGAVPGPXQGRDRLTVIjMCAIIA 

tgshrlkpiaigkcsgprafkgiqhlfvaykaqgnahvdkbifs 

DWFHHIPVPSVREHFSTIGLPBDSKAVUiLDSSRAHPCSAKLVS 
SHVFTIFIjPASVASI>VQPM0QGIRRDFMRNPINPPVPLQGPHAR 
YHHHDAI FSVACAWAVPSHVFRRAWHXLWPSVAFAEGSSSHBB 
LKABCFP VKPHWKS FAHU^LVKBGSGCPGQLHQRQAASKGVAO 
n^BrafipppjutTSpainniBgnRgTPgnnQTigprmpftpfimwniwg 








QAAVAFDAVLRPABRQPCFSAQBVGQLPALRAVFRSQQQVRKRR 

GAliGAWKVEALQBGPGGCGATAQSPIiPCSSTAGDN 


1 


1315 ~ 


2694 


IJUtP AQP VLLRB PBGAGPP W AGHL VHHliQGGliliRERAHP D tiKA 
HKHPLPCIX^MPHPQMCIGHLRMVKANSRPVVWOIGYDHTAWVYTG 
GYGGGCPQGLASSTSNIYTQSEVKCVHIYEHQRWNPVTGTrSRG 
LPTDRYMWSDASGLQBCTKAGTKPPSbQWWTVSDMPVDFSVPGG 
TDQEGKQVASDFPASYHGS KTMKD FVRKRCW ARKCfOiVTSGPWI, 
BVPPIALRDVSIIPBSPGABGSGHSLALMAVSDKODVLCRKJVS 
KLNPAI3SSWLHVGTDQPFASI3IGACYQVHAVAEIDGSAFYRGSV 
YPSQPAGBCTrYHIPSPPRQRIiKQVSAGQTSVYALDBHGNLWYHQ 
GITPSYPCX^SSKKHVSNNVCRVBVGPUXJVlfVLANKVQGSHSLS 
RGTVOncrGVQPHEPHGHGWDYGIGGGWDHISVRANATHAPRSS 

sqkqbpsappbahgpvcc 




1 


575 


g^CSE^Y0^5Ba>mB0wS^SsgroCTfl55Ag5SBr" 


l 






HKVOTCDLLLWHMHPIiPGTQIJjDWtfXnXroiAFRFLNYVQiSQHQR 
KQKRQLRAQQHLS HBBlAKEY^iBBDSLGGS RVVVCDIHKBMtiK 
VGKQKALAQGYRAGLAIfVl/31AKKLPFDDDKFDIYTlAFGIRNV 
THIDQAIjQKAHRVLKPGORFIjCIiBFSQVNNPLISRLYDLYSFQV 
I FVLGBVXAGDWKSYUYL.VKS IRRFPSQEKPKDM I HlAGFHKVT 
YRSLTSGIVAIHSGFKL 




1464 




RSAVANGLTKRRKOIiKINGHYlSLILAVQIAYLVQAVHAAGKCD 
AVFKOPSDCLIiKLGDSMANYPQGXiDDKi'N IK'l'VCT YNBDFHSCT 
VTALTDCQEGAKDMHDKIiRKBSKNIJflQGStFKLiCGJSGNGAAGS 
IiPAFPVLLVSI>SAAIiAIWIiSF 


6566 


3 


1385 


KYBSAQPGGTQPKPGLG3\RMAXKKALVMCXjGLPLFI*FPGAWAQG 
HVPPGCS QGXNPLYYNlJCllRSGAIfG I VLBAVAGAGIVTTFVLTI 
ILVASIJFVQDTKKRSLliGTQVFFIibGTbGbFCLVFACVKKPDF 
STCASRRFLFX3VLFAICFSCLAAHVFALHFLARKNHGPRGWVT f 
TVALLLTLVKVIINTEHLIITIjVRGSGEGGPOGNSSAGWAVASP 
CAIAHHDFVMAblYTMUiIiIcGAFZiGAHPALOGRYKRHRKHGVfV 
LLITATSVAIWVVWrVMYTYGNKQHNSPTHDDPTLiAIAIiAANAW 
APVLFYVT PKVSQVTKSSPEOSYQCTMYPTRGVGYBTILKBQKD 
OSMFVEHKAFSMDBPVAAKRPVS P YSGYNGQIXTSVYQPTBMAL 



523 
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sequence 


Amino acid segment containing signal peptide 
(AwAlanine, C«Cysteine, D^Aspartic Acid, B= 
Glutamic Acid, P= Phenylalanine, G=Glycine, 
K»Histidine, I^Isolcucine, Bt»Lysine, 

I*= Leu cdne, M=$tethionine, K=Asparagine , 

P= Proline, Qt=Glu taraine , RsArginine, 

S=Serine, T=Threonlne, V- Valine, 
WaTryptophan, Y=> Tyrosine, X^Ooknown, *°Stop 
Codon, /apos aible nucleotide deletion, 
\apossible nucleotide insertion) 








PQiKVPSEGAYDIIIiPRATANSQVMGSANSTIiHAHlMYSAQSHOA ~ 
ATPPKDQKH3QVFRSPYVWD 


65^7 


125 


863 


TKRSNLKAXACS IHHIRTHSYVFVMDSSQTliVPijLOACIDQDPN 
YSKRIJLSSGFDPHIRDSRGRTGISIiAAABGNVDICQLLHXFaAD 
LlAJDYQCayEALaLOQCVDTIQFLVBNtJLKIDICWHQGATPLVL 
AK3RGVNKD7IHL[£SI£BQBVK)QFNR0THSKLBTMQTABS8SA 
HHSHSLmPHLQQGEGVbSSPRTTWQBFVEDIjGPMRVIiUiIFVI 
ALL3LQIAYYVSGVLPFVBNQPBLVH 


6568 


3 


1183 


HASDRliVLPDlS^SHFSQASANIiQG PS RTTKLFHPTLAS ISSPM 
LBGABLYFNVDHGYI£GLVRGCKASXI*TQQllYINLVQCSTLBnL 
KIHLQTlT>YC2JPIiAi3HTTJPX.TVSKIDTEKaKRIXXJBFBYPRHHp 
LRPLSTPLTYMTCSyMIDNVUiIjMNQALQXKSVKKIIiGKQiPLG 
RPTBHBAVHIABTPSDLPHAXLXBTPIjAPFFQDCIGBKALDSW 
XELLHBIKUfKSTI^AFYKPCRHHQDVTABVMCPXLKFEADRRAF 
XlTIiN5PGTBI»3KKriRBTtiYPTFGgl>YPBGliRl»bA^BDFDQMK 
HVADHYGVTXPliPBAVGGSGGKriiRDVFYBRHVQJWVIAFWRQP 
HYGVTYATVKLKBQBIRIilVWIABCISQRHRTiaSSYlPIL 


1 


205 


1532 


p PRO VQfi r PTPTJ JZRJiRTAGPSUHRKgASAPQGljRPVDPR 

BMSWLFPL7K3ASS9AAGS PGGLTSLQQQEQSLIBSI£HSH5SI 

akiqkdvkyrlpptiknltininu*i>ppqfpqkkpvisvtppir 

HHLMDKQ6VYVTSPLVNMP1HHSDDQK11QSLLDBFHKHPPVLA 
PTSTAFP YLYSHPSOM3 PYASQGFPPLPPYPPQBANRSI TS LSV 
AOTV3SSTTSBTTAKEAAPS?GVI»SNI*PLPXPTVDASI PTSQNG 
PGYKMPDVPDAPPBLSSIiSVSQLTDMlIBQBEVIiLEQPLTLPQLK 
QIITDKDDLVKSIKKIARKNIiUjKPSUEAKRQTVLDKYELLTQM 
KBTFBKD4QRQHEL8BSCSASALQARltKVAAHBAEEBSDNXABD 
PLEGKMEIDDFLSSPMEKRTICHCEmAXEBKLQQAIAMHSQFHA 
PD 


1 


130 


1304 


ARLPidjTFLKBGPIiYVIiLSHWVFVGAPRPPASDSHKXGLVPSAP 
PA3RKM03 KALPAP I PIUPSLQI/TNYS FLQAVFTFPATVnHLQG 
LYGLSAVQTKHMHHWI/3YPISVHEITRSTIYBMAAAQGLVDAR? 
PFPALPFTTHLFHPJCQGAIAHVDPAIiHiaMtWlFDPAMLAVAATCt 
BDPPKMGDLSKLSPGLQSPISGLSKDTPDRKPSRGaLPSETBXB 
FICKPOGBkHFXtCSYHIJJHBBTHTDBRPYTCDICHKAFRRQPHL 
RDBSYlBSKBKPPXC0BOOiQSFCQ3BTlAVHRrLBKQfrSSPTAA 
SSAAKCSGBWICGGT 




169 


656 


APDMKRKXIX2KDTI7njXKlfaCHIiPRGFDKEWtX3CViaVLKWI KGL 
SDPTJIGSXiBSKMKTCFBVFDIJJQDGFISKKBMPHMLKKSsLIJCQP 


5 BEDFDEG IKDLVBITLKKMDHDHDGKLS PADYHLAVRBBTLLL 
KAFGPCT.PDPKSQWKPKAOVFKDPNBFNDM 


6572 


49 


izzz 


TPKBflQ Pfiar JT ^ iAAfiUL’VCGGRWWP'RSHKRG YFSSAKMGSKREN 
LfiCBHRHQKLWDlflrf ¥ CKKI^VQALKMVMSQIRKQHVQMBNDHRV 
QRKQFLRIJ>QWBQFBlJ>HBBAXQKAKBMKRItKBIXg»KQBBKLftM 
BLAKlHHBSIjKDBKMRQgVRBHSXBLRKliBKKLKAAYMlTKERAA 
QIAKKDAIKYEQMKRDABIAKTMMBBHKRI i rkenaaedkrmka 
ICAQYYIJDKKD riRBQBKKKQRATEQXJ^HKICLMIDBrVRKIYKED 
QLBKQQKLBKMHAMRRYIHBF0KE0ALMRKXKRSBMEHBMRXI1 

hfanmqqqrbedsmaxvqbhbeeslqlchbu>tqklbbmlrcprbd 

LBQVRQBLYQEBQABIYKBKLKEBABKKLRXQKBMKQDPBBQHA 
laCBDVLQAAKESBBHFRKTMIiAKFAEDDRlBUlHAgRQRMROLB 
HRRAVBKIiIESRRQQFIiADKQRBLEBWQLOORRQGPI NAI IEEE 
RUOtLKBUASNLLSYIiPKGVPKKEDDIDLLGEEFRKVYQQRSBl 
CEKK 


6573 


767 


275 


GGGGGBSQSraAQDGTRTPATDC!2«IiQGPHXI>TrQGGYD«VQK 

lfldffrurlsqrptakbleqrhilkprhbqkbqbbkrbikrrl 



TPifT.gOPPTPBgt.HBiaCIURPSDYVEWllAQPlfDimADKPWTR 
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beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 




acid segment containing signal pep t id 
{AsAlanine, C~Cysteine, D=Aspartic Acid, B=» 
Glu tami c Acid, P= Pherrylalan i ne , G=<31ycine, 
H=Histidine, I^Ieoleucine, K=Lysine, 
L=Leucine, M=Methicmlne, N ^Asparagine, 

P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, Vo Valina, 

Tryptophan, Y^Tyrocinc, X -Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
V=possible nucleotide insertion) 



LTAAD 



lessvpvsvgvfwaogvswtgaaglqdgalsdtmarhaekamta 

LARFRQAQLHBGKVKKRRPPIASKCTKLPKAKKJfRRQXIGSISX 
KVRQXQHRGLGEPRXRDIMDSlHKIJjR BBjHH BVRXKKLQGPDY 
GKVGPKMLDHEGKEVPOiRGYXTFGAAKDLPGVRKLPRKKPIiP P 
PRKTRAKUCKAIDPBYYG YIjDHDDGV XVPLEQEYBKKLRAEIjVB 






GDDSQQKFIAHVPVPSQQHZBBAZiVRRKKMELbQKTASBTLQAO 

SEEARRLLGY 



S PAbftSQSGGITBBgMLHPQBliqVlDbPDYBllVBDgTFPPETPP 
AS PERQDGKGTKFDEESQK3APVPVPPKRTVKRHIPKLDAQHLI 
SSRGLPALRHVFDKAKPSGKGHEAEEXLEMLlRBHBHHAHRItPPK 
WPBDPIDRVBYMSKKBVQTCWCRXRLDLPILHBDFV3NNDBV 
AS22NBH0VTSTBLDPFLH7LSSSBMFASSLSISLTSEQQQRXBR 
HKQLALKRRQAKLP 



P B PQALVGQKRGALRLLVARLYLTVSAP, 
RETRALADSHFRGLGV0VPGVGQAPGRVAFVSBPGAPSYADFVR 

gpllphlpcvpssaftqgwgsrrrhvtpagrpdpdhllrtygdv 

VVPVAKOgVQ g YNS HPg BaMTLHDYXTyWKHYlQftOYS3PROCI» 

YLKDWHLGRDPPVKDVFTLPVYPSSDHIiHBPWDALDVDDYRFVT 

AGPADSHSPFHADIPESPSNSVMVCOIKKHZiLFPPGOBEALROR 

TCNLPYD\TTSPAI»C3)THLHKfflQI»AGPPtBITQB3IGEWVPVPSG 

WHHQVHWLVMCCPSCPLSGAPLOfiDG S TISPLSQPgLGWMGVAH 

G 



SDRMASDDPDIVXBANLEAPYKKBBJDBQQRKBVKKDYPS1ITT8S 
TSKSCafBTSGSSTIGBTfiHRSRDRDRYRRRNSRSRSPGRQCRHR 
SRSVDRRHGBESRSRDHRRELttfVHYRSPPLATGYRYGHS KSPttF 
REKS PVREPVD27L3PBERDARTVFCMQLAAR1 RPRDLKDFFSA V 
GKVRDVRI IS DRHSRRSKGIATVBFCSIQSVPIAXGIiTGORIxLG 
VPI IVQASOABJOnUiAAMANNLQXCTGGCTIRLYVGSLflFNITED 
HLRGIPKPFGKIDNTVLHKDSDTGRS K GYGP ITFS DSBCARRAL 
BQI2JGPRLAGRPMRVGHVTERLDGGTDITFPDGDQKLDLGSAGG 
RFQLMAKLABGAGIQLPSTAAAAAAAAAftOAAALQLNGAVPLGA 
LifP AALTALS PALNLASQCIjQLSS LFTPQTM 



PSSSATHHRAP1iKRAT11£MAIiTQASDPSABABAHGEKPFLLRA 
LQlALWSLYVfVTSISMVFLIi KYLLDS PSLRLDTPI PVTFTQCL 
VTTLIjCXGLSAIiAACCPGAVDPP3LRIiDijRVARSVI»PLSVVFIG 
MITPNNLCIiKyVGVAFYHVGRSLaTVFHVLLSYIIiLKQTTSFYA 
LLTOGI I IGGFHLGVDQKGABGTIiSWIjGTVFGyiASIjCVSLNAI 
YTTKVLPAVDGS IWRXjTFYNNVNACIIiFLPIiLLLIXJBLQALRDP 
AQbGSAHPWGrWTbGGLPGPAIGYVTGLQIKFTS PWHNVSGTA 
KACAQTVLAVLYYEBTKSPLHRTSIOOfVIjGGSSATfHVRGffBMK. 
ICrPRBPSPKDSEKSAMGV 



RPPRVWY PELRSl«SAAAPRCf SHRTAPGIMVPYFTSSSVNSSAYT 
rYMGKDXYENEDUIKHGWPEDIWFHVDKLSSAHVYLRLHKGENI 
EDIPKICTIMDCAHLVKANSIQGCKKNNVHVVrYTPRSNLKKTADM 

dvgqigphrqkdvkivtvbkkvnbilnrlektkvbrppdlaaek 

KCRDRKERNHKKAQIQEMKKREKKSMKKKRBMDBLRSYSSLMKV 

ENHSSKQDGNDSDKFH 



LVAliKHWKPKGTWIPAPQSPVFGBAvSGVYMMTKVbGMAPVLGP 
RPPQKQVGPLHVKVEKKBB KGKYLPSLEMFRQRFRQPGYHDTPG 
PREALSQLRVLCCRWLRPKIHTKEQHiEU>VIiBQFI>TILPQBLQ 
AWVQgHCPRSAREAV TT .LRT1T .mrer jDgP G K QfV ST PPHSQKPVWB 

kisssgtakbspsshqpqpletshkybswgplyiqbsgeeqbfa 

ODPRKVRDCRLSTQHRBSADEQKGSEAEGLKGDIISVIIANRPB 

ASLBRQCVHLBHBKGTKPPbQgRGSKKGRBS VPTK P TPGER RYl 
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SEQ 

ID 

HO: 


Predicted^ 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine. D^Aspartic Acid, K= 
Glutamic Acid, P=Phenylalaniije, Gs=Glycine, 

He Histidine , I=»l3oleucine, X=I*yslne, 

Leucine, M=ttethionine, N^Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 

SsSerine, TVThreoniae, V=Valine, 
W=Tryptophan, Y=Tyrosina, X^Untaoovn, ♦-Stop 
Codon, /^possible nucleotide deletion, 

\ ^possible nucleotide insertion) 








OVEOaKAFSdSSNLTKHRRTHTGKXPYVCTKOoXAFSHSSIlLTL 
HTRTHLVDRPYDCKCG KAPGOSSDLLKHQBMHTHKAP YQCKDCG 
KAFSGKGSIiXRHYRlHlGEKPYQOiBCGXSPSQPAGLSSBQRLH 
TGKKPYKCXB0SKAPHHSSNPHKH0R1HTGBKPYKCBHCGKTPC 
SKSNL&KHQRVHTCBGKAP 




22b 


476 


RVPtiRDI^STPMASNBTASlAOAHKLVEQIiKMEABIDRlKVSKA 

AADLMAYCEAHAXEDpliLTPVPASENPFRKKKPFCAXL 


6SB2 


1428 


718 


CFTTK3HC3PVSVPYL3 PLVI.B KJ^^LLKMKGDQVlHYSSPltT" 
QHPlIFWTLVWYFRRLDLPSHLPCTJCLTSKHCNKGVQLPLSSLS 
QnSXLVYIQLLWDNINLHQBPRKPLYVSKHKPNSBKKSSLLSBB 
QQBTSTLVBTXRQS IQHHHVZiKPINLLSQQMXPGMKRQRSLYRB 
I LPLSL VS LGRBNI D IKAJFDNKYG I AYNS LS SElLSRIiQKinAP 
P3ASVEWCRXCF3APLI 




487 


41 


RI FSMTSGltLHWRCTHRPA^WSASLRUn^fflPSPRSIsE?“ 
LSMMLSPLPSNTRGLS PTALFRS PDSKHATSCPEtLHLWRCRAPL 
RSPSPLGRU}VLPRS PUIViniQISGKBVLGLQVQRSHSGTGSPAC 
SOAGSGAVQGGNWCIP 


6584 


189 


1750 


pi^haalgpssqnvtbywrvpkbttkkyn ihafnaadkvnfat 

HNQARLERDL6NKKHrQKBEMPB3GAGSBFNRKLRBRARRKXYG 
IVLKKFRPEDQPWJjRVNGKSCHUCPKBIKKQGVTENTSYYIFTQ 
CPIXIAFEAFPVHSWYKFTPIjARHHTLXABEASEBNBHRNKVIIIH 
FSIMOQKWJCDQDGDBDBBEKBHSjGRRKAS SLR IHDLRDD LEMS 
SDASDASGBBGGRVPKAKKXAPLAXGGRKKKKXXBSDDSAFBDS i 
DDGDFBGQBVDYMSDGSSSSQHEPESKAKAPQQKBGPKGVDBQS 
DSSEBSEKBKPPBEDKERBEBKKAPTPQBKKRHKDSSBESDSSB 
SSDIDSKASSAFFMAKKKTPPKRKRKPSGGSSRGNSHPGTPaAB 

qgstsstlraaasklbqgkrvsbmpaakrlrijjtopqslsgkst 

PQPPSGK3TPNSGDVQVTKDAVRRYLTRKPMTTKDLLKKFQTKK , 
TGLSSBQfl^lWXiAQHiKRlJlFERRMINDXKHFSLKB 




1 


1678 


GPIAf7SRIDDPVGGDpRAKASC3 VLHS KPHAMADSSDPASD04Q 

HHKBQRAAQKADVI,7TGAGNPVGDXLNVITVGPRGPI«LVQI>WF 
TDBMAHFDRKRIPKRVVHARGAGAPGYFKVTHDITKYSKAJCVPB 
HDGKKTPIAVHPSTVAGBSGSADTVRDPRGFAVKPYXBKETHDL 
VnNNTPIPPIMJPIt^^PIBSQKRNPQtHLKDPDMVTOFWSLR 
PBSLHQVSFLPBDRGlPDGHRHMHaYGStnFKLVHANGBAVTCK 
PHYHTDQGIKHLSVTEllAARIiSCJSDPDYCItHDLPHAIATGKYPSH 
TFYIQVMTFKOAJBTFPFHPPDLTKVHPHKDYPLIPVGKLVUIHR 
PVNYFABVBOIAFDPSHMPPGIKASPDKMtOGfiLPAYPDTHRHR 
I/^HYLHIPVNCPYRARVANYQRDGPMCMQIISQGGAPNYYPRSP 
GAPBGQPSALBHS IQYSGBV3RPOTANDDWVTQVRAFYVNVLNB 
KQRKRLCENIAGHLKDAQI PIQXKAVKRPTBVHPDYG3HIQALL 
DKYNABXPKNAIHTFVQSGSHIAARKKANL 




32 


804 


PLPBQPA33TSTOPVSGTPAPNKKRXSSKLIMBLTGGGQBSSGL 
NbGKKISVPRDVNL^KiSLliTNSGSKMFKLRQHHVBKFTYBHHP 
DVFSDSSMDHPQKFLPTVGGQLGTAGQGFSYSKSKGHGGSQAGG 
SGSAGQYGSDQQHHbGSGSGAGGTGGPAGQAGHGGAAGTAGVGB 
! TGSGDQAGGBGKH ITVFXTYI S PWERAMGVDPQQKMKLGXDLLA. 
YGAKAELPKYKSFNRTAMPYGGYBKASKRMTFQMPKV 


1 


75 

j . : 


HI? 


RRVPS LGKHP^CWDGBHDI KT P YGIJjHVVTHGS PKGNRPA1LTY ” 
' HDVGLHHKLiCFNTP ptjfedmqbitechpvvchvdapgqqvgasqp 
PQGYOFPSMBQIAAKLPSVVQHFGFK yvIGIGVaftGAYVLAKFA 
LI E^LVBGLVLWroPKGKGWIDMAATKLSGLTSTLPDTVLSH 
I^SQRBLVNNTKLVQSYRQQIGNVVNQAKLQLFWHMYNSRRDLD 
INRPGTVPMAKTLRCPVMLVVCTOlAPAEIXTVyECHSKLDPll IT 
PLKMADSGGLPQVTQPGKLTEAPICYPLQ6MGYHP3ASMTRIAR9 
RTASLT9ASSVDG3R?QACTHSB3SBGLGQV1JH1WBV3C 
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SBQ 

10 

NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
nmim acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A°Alanine, C=*Cyoteine, D**Aspartic Acid, B= 
Glutamic Acid, F^Fbenylalanine, G«Glycine # 
H^Histidine, lolsoleucine, KaLysine, 
I*=Leuclne, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamlne, SaArginine, 

S=Serine, ToThreanine, V=Valine, 

Ff^ Tryptophan, T^Tyrosine, XsGnknown, *»Stop 
Codon, /«=pos Bible nucleotide deletion, 

\ ^possible nucleotide insertion) 


6588 


137 


501 


JjQLQAQIiIiK5RTNNY QLSDRkRKNGVBLTS LRQKVAYLDKKFS JC 
AOKAI^KSKKAQKVBVLLSKNKHLQAKLHSQBEDFRbQNSTIiNA 
BFSKLCSQKKQIiEQBNQQbKKGAAGJVGVAQACT 


6589 


2 


1405 


RPN^AKATPSFQBPF^LLQ(^t2lPTAQQGl4DQIinilAlWVl CLA - 

CKUJ<ELGLPSYIiKHASTVAGGPPSX.YHPPQIJBMVHVVL1iSLIjC 

YLVLPlJCBHSSHRGVFLSVTILIYIiUCGBNHMVDTVTNHEMnQA 

QMXVAMKAySbGgDLDRGBVGTVPSPVBPHgYLYPVGTrVPGPW 

IS PHSYbQAVQGKPIiSCSHIiQKVAHSZtAlAIiLCliVbSTCVGPYL 

PP YFI PUfOTRLI^NnCRKAfiGTWRSfLaAyBSAVB FHFSSYFV 

GFLSKRIATLAGftSFTgBKnHLBWniiTVSKPmVKbPRSMVEW’ 

TSWRl*PMSY¥bNNYVFKN7\LW/5TPSAVI,VTYAASAI*LHBPSPB 

LAAVLLS LAPITWKHVLRKBLARIIjSACVLS krcppdcsbqhr 

LGlJGVRAXJlLliPGALAXPHLAybGSIiFDVPVMyrTgBQgyaiAY 

TVHJWSBLSHA6HHVTPGCWIFYRLIO 


6590 


2177 


656 


VRAYHHVI^U^JVFTPMFamDBYFRQIJaGAKSPTJmSKLNH 
GSLSLDDFENTQKHGESFGISRIGSJCIIOGTFKSTTMEGAHIiPNY 
aVAEGBDDPIKKGIWMBDDSPVKAV5TPirrPKRLAA»KISIPY 
VDFYKDPSSRRKBKXBRI PVPCIIXVKRNDRfiAVGHKPKIIWSVYR 
HYLBFYVLBSKJjTEFHaAPPnAQLPSKRI 1GPRNYBFLKSRRHB 
FQBYLQKIXQHPKLSNSQbLADPLSPNGGBTQPLDKILPDVHbO 
KIIKSVPGXUOCEKGQaLBPFlIQfPINSCBSPKPKPSRPELTIL 

sptsennkklpndlfhiqianrabnterkqhohyfmbvmtvbgvy 

DYUfTVGRVVPQVPDWLHHIilJWTRlLkPKNTLBMyTDYYLOCKL 

EQLFQBHRI*VS1^TIi1jRI3AJPCENTBPRSIjQDKQKGAK3C?TFKEH 

MNyiPDLLVKCIGBBXKTBSimiPDGIiQQPVbNKQLTYVLLDI 

VIQKLFPBLNKVQKBVTSVTSWW 


6591 


2177 

1 


656 


VAXyBHVLS LLENVFTP WraiRDKYFRQLLRGAES PTRNSKLNR 
GSLSLDE>FrafTQKRGB3PGISRIGSKXKGVFK8 , riMEGAMLPNy 
GVAEGKWJPIEBGI WMEDDSFVBAVgTPNTPRNIAAHKXS IPY 
WFFBDPSSBRXBKKHRIPVFCIDVHttlDHRAVGHBPHHRSVYR 
RYLBPYVLBSKLTBPHGAFPnAQLPSKRIIGPKNYBFIiKSKRKE 
FQBYLQKXXQHFBLSNSQbLADFlASPlIGGBTQFLOKXIiPDVNIiG 
KUKSVPGKLWKBKGQHbBPPXMHPINSCBSPKPKPSaPKLTXli 
SPTSKNNKKLPNDLFKNMAKRABlfTHRKC^NYFMKVMTVEGVY 
DYI^fYVGHVWQVPDWLHHUiMGTRIIjPTCirrLKMYTIJYYLQCKI. 
BQLPQBHRIiVSLXTLUtDAlFCBElTBPRSlXlDECQKGAKiQTFEBM 
HNYI PDLLVKCIGBBTKYESIKLLFIX?LQQPVUnCQliTYVLLDI 
VXQHLFPEZ2IRVQECBVT3VTSRH 


6592 


3 


1861 


APBPIX3ST ISSGSMIDANIJCIiLQBABQRIiKAXVAEFCFAXATKEG 
DLFGVBRFFKIFPLLGLHHBGLRKFSEYLCKQVASKABBNL1MV 
LGTDMSDRRAAVI FAtrrLTLLPKG lARXVBIHQPIVKTyYGPGR 
LYTLIKYLQVBCDRQVKKWDKFIBQRDYKQQFRHVQNNLMRNS 
TTKKIEPRBLDPILTSVTLMNAHSBLYLRPLKKRIfiSDPBVGDS 
HASEBVXQBI^KCLDyi JJ8NC1 »T iSCTMQjEttilGLtVTWKBYFHRB 
TVNKAVALDTYBKGQbTSSMVDXIVFYlVKKCIGRALSSSS IDCI* 
CAM INIATTELKS DBTIDVLCNKIjRMGFPATTFOD I QRGVTSAVN 

zhqssloogkfdtkgibstdsakhsplvtuwvevcsbnistlk 
KT bKSDCTKLFSQGIGGBQAQAKPDSCLSDLAAVSNKFRDI.'LQB 
GLTKLNSTAIKPQVQPWINSFFSVSHNIKBKEPNDYKANDPWVQ 
QPIUIliEQQMABFKASLSPVlYDSLTGLMTSLVAVELBKWLKS 
TFNRLGGLQPDKKLRSLlAYLTiVTlVTIKbKFARLSQMATI LN 
LBRVraiLDYWGPNSGPLTWRLTPAEVRQVLALRJDPRSEDIKR 
LM. 


6593 


3 


1037 


EAFSAGSKRi?IILALURGVIiGGliGGYCPCCCRRRGRLL.VliLI»I>VJi 

RGGBGGGGBGgGDKRRRRQARRCgRRPBPABARGGKMADVLSVL 

rqyniqkkbiwegdhvipgbfswpkbvktnywwgtgkegqpr 
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corresponding 
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amino acid 
residue of 
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nucleotide 
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to first 
awinn acid 
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KYYn*DSILFLLNMVKL*SH FVYVRRAAXEWIPVVHRPDRKDLLG 
YX^C3BASTSASIDESAPLBIGLQR3TQVKRAADHVLAKAKKPRI 
BDBBCVRLDKERIiRARIiRGHKEGIVQTBQIRSLSKfiMSVEKlAA 
IKfttCIMAK K K S T I B nf DL DDDITAIJCORSFVDABVPVTRDIVSRB 
RVWRTRTTI LQSTGINPSKNIFAilXJS VXARKKntfirBQRPAFN 
AAPVDPTIJtTKQPIPAAJroRYD^RPK CKiai T l^ygJLOl l CrryH 
C31TLKSVTBGASAHKTQTPAAQPVPRPV8QftRPPPSQKKGSRTP 
III IPARTTSl>ITMLN7lKDLIiQDI*ECFVPSDBKKKQGOQRE3®TL 
IQRRKD Qf tQ JGGTA ISVTWYRVVDQPLHlilPQpKDgVVKVFVQ 
GPAK^KOWPlOiLPIX^PVDIPAKIKTlFHI^YIIBVHLDPm^KW 
DVTVI£I»5yHKRHLDRPVFLRVWBTTiDRYWVKHKSHIiHF 



GNFWSA 

DI LSTIGTDHIIQHLSKOlKK CKBFKDFLKBRAAXHBRYGKDIiL 
NI>SRB3PCGOSBlim*KRAL3VFKQaVDHVAQCin;QLAQSLREE 
JUtKMBBTSBKQKLQRKKTBLIMDAIHRQICSLQFKRItaAXICIITB 
QKCRDKDKAEQAVSRSAKLVNPKQQKKLFVXIiATSKTAVEnSDK 
AYHLHIGTLDKVRBKWQSBHlKACEAFBAQBCBRINPPKHALWli 
HVNQLSQQCVTSDBMyBOVRKSbKKCSIQRDIBYFVNQRKTGQI 
PPAP rM Y BHT Y SSQKHAVPAGKATQPNIAHBGPLPI PKSSPDDP 
HYSLVDDYSLLYQ 



KBSVKUiHDWIjYLH 
RYlJAYPSBQBKLSIiSGQTNIjGVLQIQIWPlNARRRIjLPDMLRKD 
GKDPNQFTI SRRGGKA3DV7VLPRGSSPSVIAVSVPAPTHVLSLS 
VCSMPLHSOQGEKPAAPPPRGHIjBSPKPLVTPGSTLTLLTRAKA 
G3 PTGGIiPHTPPPTPPEQDKKDFSSTOLLVBVAIiQRAAEHKLQK 
QQDPSLPLLHTPIPLVSKMPQ 



PRLPVRRYHGRRRIOGRfiRGHMABGIXAGSDQRQNSBISAMAAlY 

C«BWCVI 3 H>C 3 UaFCIRISrnOTDPKWTL^^ 

P I YQLHAPHTiKGQBRADLSNSIiRBI Y I QNIGBS IL»YLWVEKI RD 



I #1 O) : -J mk :*-* :* ►i'i :■ ^ K*\ ft • r i ■ 



FDISBTKTEVEVBKLPPIDHOIPITDRRSTFOAHIAFVVCPKQV 
WflVLSKLYEHKKXASATEKIYAYR IYCEDKQTPLQDCBDDGSTA 
AGGRMjHUffilLHVKHVHVWSRWYGGII^PDRPKinitHCARN 
ILVEKNYTNS PEBSSKALCKNKKVRKDKKRNHH 



V i u : i m .i • ] f t \ : a v < ^ i ; i-im j>, ty >-sH >'» i ; i > ) .in :< t :■ * ; r;V^ t'j 



P HLP 

GEBHCVX DDCAKI PCIRISDDIDDPKWTLCLCIVMLPHBYPGTAP 
PIYQLHAPWLKGQERADLSNELEBIYIQNIOBSILYLHVBKIRD 



■ A T i f)(i):4>t A Z t y V ii T j 4*i*i*± LUi nif;T» 



PDIS B TRTBVH VE BLPPtDHGIPrrDRRSTFQAHIAFWCPICOV 

KMVLSKLYKNKKIASAIEHIYAYRIYCKDKQTFLQDCKDDGBTA 

AGOUiUBUffilUWKMVMVWSBNYGGlLtdGPDRFKHIHMCASN 

ILVBHHYTNSPBBS3KALGKHKKVRKDKKRNHH 



PRVRHAXTMAMSFBWPWQYRPPPFFTLQPNVDTRQKQLaAWCSL 
VLSFCRLHKQSSMTVKBAQBSPIiPHHTKLQRKIaP'VBS IQ I VLKE 
LRKKGHliBWICKSKSSFIdKHRRPKBWaKLIYQWVSRSGGNHSV 
PTLYBLTNGEDTKDKHFHGIxDBATLIiRAliQALOQKHKABI ITVS 
DCTRRQVIJACna»PLLLTSHI>SRAFICRHQTQCPPICTGSVTPPD 
SKGbQS 



KMAALTTLFKYIDEWQDRYIKKLAKWVAIQSVSAWP8KRGBIRR 
KMEVAAADVWQLGGSVBLVDIGKQKnPDGSBIPLPPILLGRLGS 
DPQKKTVCIYGHU)VQPAALBDGWDSKPPTLVERDGKLHERGST 
DDKGPVAGHINALKAYQICrGQHI PVNVRFCLBGMBESGSEGId}B 
IdFARKDTPFKDVDYVUiSDtryWbGKKKPCITYGLRGICYFPIK 
VBCSNKDLHSGVYGGSVHEAHTDLILUH3SI»VDKRGNII»IPGIH 
BAVAAVTKKBHSLYDDIDFDIKRFAKDVGftQXLLHSHKKDILMH 
RWRYPSLSLHSIBGAFSGSGAKTVIPRKVVGKFSIRLVPNMrPB 
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ptpsig 








W0BQVT3Y1*TKKPABIi&SPMBPKVYMGEGGKPHVSDFSUPHY1j i 
WGHRAMKTVF0VBPDLTRKG6S I PVTLTFQSATGKNVMIlLPVCS 
ADDGAHSQHKKI201YHYIKGTKW1JVAYLYBVSQLKD 


6600 


2 


934 


PG£LKHVAANKBAGLEQUitBLLLPDTKRIRBATEQtjQIVLRAP 

AALSALCDLlAfiAADPQIiy)P^VLTTUlHI2rrRMRRliAABQRfiS 

LKSLXXiTALQRBTBHCVSLSLAQLaATIPRKKGLBAMPQLIjQIiL 
QHSTBSPH3 PKRBMGUJJ*SVWTSRPEAPQPHHRKTiTiRLf»KBT 
LGBVGSPGLIjPYSIjItrLTTlttPYI^STKDVPIJUUCLVPKlIKAMQ 
TT.TPTnRArar^nT.gaT.nKrj.v^^’JUTTPYi^KVT.Tvrri.RVAH 
HVALGHAIR IRIDCCLT7LVKVKS KALLKTOttilATLAAHPPPHC 
GC 


6601 


529 




PRAAARAP P PAVLRH DRRAATAPGA0BW7XHGPL»AQRYFI*NHIB 
K1 TTWQDPRKAMNQPLHHMMLH P AV39TPVPQHSMAVSQPHLVM 
NHC^Q(XJHAP9TL3QC^HPTC^PAGI^S«PNALTTQQQQQQKL 
HLQRIQHBRBRIRMRQBBLMRQBAAIjC3E0I*PMHABTLnFVQAAV 
MPPTMTPDNBSIlTOTSSDPFt)!iBGPYHSRE0STDS6LGLGCYSV 
PTTPBDPl*SWVPBC3TCgH7^GQTPMHINPQQTRFPDPXjDCLPGT 
NVDICTLBSBDLIPLPNDVBSALWKS2PFLTWX, 


6602 


127 


617 


IiIiDPPAliPKBVl^U^tKAOKP3T74TSMTOSl»HKVlK7vKTKRRNP 

HRVUGKITLVSAAPGKVICBHKVHBBHTHAIGTLHGGLTATIfVD 

HISTMMJiCTBRQAPGfVSVWfillTYKSPAKLGKDIVITAHVLKQ 

OKTtAFTSVDLTNKATOKLIAQGRHTKHIjGH 


6603 


79 


660 


PVGPSSLR^TGXiGHLPFliHRIjASSRGIjDMDLLQyLiAFtiFVljIi” - 

SG>*3ATGTLRTSIjDPSI*BIYKKMPHVKRRBQLIALKNIiAQIiNDI 

^YKrUJVMlJDGr^KVLKDSRTVL’ZAADVLPDGPTPODBXLKD 

7^SHWVKWEAFFGDVVLRFPRIVHYYPDHNSKW!ILI»IRKGISFC 

NQTCVPKQGPHSPILSM 


■ 


3 


688 


T9TA0RQGGBHMS PRGGGRGGFNRGGGGGaPNRGGSSNHFRGGG 
GGGCX5GHFRGGGRGGFGRGGGRGGPBKGQDQGPPBRVVX*LGBPL 

hpotodivckcttdbnkvpyfnapvylknkeqigkvdeifgqlr 

DFYFSVKLSKNNKASSPKEIiQKPYrDPYKLLPLQRPLPRPPaKK 
C5PPRGGGRGGRGGGRGGGGRGGGRGGG rRGGRGGGGGGFRGORG 
GGFRGRGH 




7 




8G8RRGAMRAAjOV01>VDCHCHLSAPDPOBDIiDDVI£KAKKAIiW 

ALVAVABHBGnaFBKIMQLSBRYNGnPVIjPCLGVHPVQGLPPEDQR 

SVTLKDIdTVALPIIEETYRDRLIAIGBVGLDFSPKFAGTGSQKEB 

OaQVLIRQIQIAKRIiNLPVNVHSRSAGRPTINIJjQBQGABKVliL 

HAFDGRPSVAMBGVHAGYFFS I PPS 1 1 K£GQQX£>V KQIiPLTS I C | 

LBTDSPALGFBKaVRNKPWNISISABYLfijQVKGISVKKVXBVTT 

QHALKLPPKLRHLLQK 


6606 


2 


1682 


PVSIRPRABVANI»3AHSASPI0IJAVI»KRIiSIJjKDIVYRQLNQIjS “ 

KSLGLIEOYOQRaXGOLPATLSPAKBBKRKDPHBESTOJSYIiSK 

KIS1DR3 1 PimPTKCKBLKYSKDLPOISI IFIPVHBALSVILR 

SVHSAVNHTPTHLLKEIXLVDDNSDBBELKVPLBBYVHKRY PGL 

VKVVRNQKRBGLIRARIEGVfKVT^TGQVTGFFDAHVBPTAJGWAKP 

VLSRlQBNRKRVIIJ?SIDinKQDNraVQRYENSAHCYBBEUfCM 

Y1 SPPKDWWDAGDPS LPIRTPAMIGCS FVVHRKFFGBIGLLDPG 

MDPYGGBNIBLGIKVWLOGGSWBVLPCSRVAHIBRKKICPYHSHI 

GPYTKRHAUWABVHHDDYKSHVYIAJfKLPLBOTGIDIGDVHBR 

RALRKSIJCCKNFQimiDHVYPKMRSYNHTV^YGELRNNKAKDVr 

LDQGPLBKHTAILYPCKSfGPQLARYTKKGPIiHI/5AI>GrTTTLLP 

DTRCLVDHSKSRLPOLIjDCDRVKSSLYKHWHFIC3*K5AIMNKGTG 



RCLBVKHRGLW3IMJCLRSCTGQRWTXKHSIK 

6607 137 98? VPACAGIJCXKAI^LIASPPRLl^KLOASCBALFSPPXQSRQTr 

GISFGGK3aRGPGVPTRT0VPAAMQAVK0TFSEI<yrKQRRPSKD 
KiroHI<BPffWVCHttPEGLBOIiJBaQTWFTHKBI42VI»YRGyKliBCP 



529 
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SGVVNBDTPKQlYiU)PPPHGIlft3TTAHnjraAFIXrTQTCiSVKFB 

DFVTAlkSILLHGTVHBKLRWTFNLYDINKDGYIKQEEHMDIVKA 

ryDMMGKYTYPVLKBDTPKt^VDVPPQKMDHHKDGIVTLDHFIiB 

SCQEDDNIMRSLQLFQHVM 


6608 


224 


H40 


RPCFSSPTGI^RLSYPHXUiQHAVbPPPKOPSPSPPMSVMrRS 

CKLCNVTI2iSA00AQftHTDGKNHGKKIiRNYYAANSCPPPARMSTJ 
VVf^AflTPVVP VPPQMGSFKPGGRVI LftTENDYCKIiCIiASPSSP 
AVAQAHYQGHNHAKRLR1AEAQSNS PSESSKLGQRRARKKGNSF 
KMMPNRHHMYTVQJOISGPYFHPRSRQRIPHDLAMCVTPSGQFYC 
S«CNVGAGKIWEFRQHLBSXQfHKSKVSKQSYHNEMENI/JYV 




1 


443 


FHIiRCRH PH V7VGGJUJVQAGLRKSRVPAPBQRLSALTIXSW SAVT 
PAAEPGNFQLS PAEPRGPLASP'VRAAPEAPCPAAEMSKLHTECTS 
PATHOAAGQBKKGKftOmKASEKBBIDiroXAFBTHKAALAIO 
GXFBRFQKRHKDPSS 


6610 


319 




grkslcnlhipirfpltypdmymghmctakkcgirfqppaiiu; 

YESBIKGKTRQRIMPVRHFSKPSDCTRAABQLKBNPHHKSYtiEQ 
VSLBQLEKLPSFLRGYLSGQSLABTHEQIQRBTi'IDPEBDLNKL 
DDKBLAKRKS 1 MDBLPEKHQKKKDDPHPVID1BVHPPQDDQLQS 
CGWDTESADBP 


6611 


97B 


212 


PGCSGAGSRVWWIjPALRHLAMSSTBSSBGRSVSFGVDSBBBVRV 

LQGVRLSENWSRKKKPSSPPPAPTSSTFC3XPGNLRAPHKHST 

LPRSGSSGGQOPSGMKBGVKRYEQHHAAIff)KLFOVAKRKREAA 

TKHSKASLPTGEGSISHBEQKSVRLARELBSRBARLRRRDTFYK 

EQLKRIBRKNAKMYKLSSEQFHEAASKMBSriKPRRVBPVCSGL 

QAQILHCYRDRPHEVIiLCSDLVKAYQRCVSAAHKO 


6612 


1724 


992 


VSTHASALSRTQGOPOHOPRAAASGAGAGTAGGGGSGGABGSPCM 
STEAQR VDDS PSTSGGSSDGDQRESVOOB VERB QVQP KKKEG KI 
SSKZAAKLSTSAKRIQKBliABXTIiDPPPHCSAGPKtiDMiyCWRS 
TlLGPTGSVYHt^3VFFLDITP3PDYPFKPPECVTFRTRIYHClNTN 
SQGVI Crj>njmmj5PALTISKVLL31CSLLTDCNPADPLVGS I 
ATQYHTNRABHDRMARQHTKRYAT 


■ 


130 


748 


BLBLSSHMPBQSin)YRVAVFGAGGVGKSSLVLRFVKGTFRKSri 

PTVKDTYRQVXSCDKSICTLQITDTTGSHQPPAMORLSISKGHA 

PXLVYSXTSRaSLKHLKPITKOICBIK3CTV3SXPIMLVGHKCDK 

SPSXZBVQSSBAKAXARTHKCAFKBTSAKIilHRVRBLFQSLLHIaK 

KXUtTVSliQIDGKKSKQQKRSSKItKGKCTIM 


1 


3 


1191 


SSAAHAMR\^VRHCKGPPt>AHGARRGiU J SPQWRnuUUiGWBI>CR — 

DSRVRKKPPWimiFPGTDQPARBAXAALHAARENKRSBLXDKIiR 

VVTtffSP8PKOTiPVKQYAVQSQIiPVYKWPDVGSGBYDVGVVASF 

GRLIi7KAI*IIjKPPYGILNVHPSCLPRHRGPAP\r[HTVIjL-JEDTVT 

GVTIHQIRPKRFDVGPILKQBTVPVPPKSTAKKLEAV1»SRI/33U9 

MLISVI*KNLPKSI#SKGRQQPMBGATYAPKISAGT3CIKKKBQTS 

EQIFRLYRAlC^IIPLaTLWMANTIKLLDLVBVNSSVIJU3PKLT 

GQALI PGSVIYHKQSQniVYCiaJGHIGVRSVMLKKSLTATDPy 

NGYLH PWYQKNSQAQPSQCHPQTIALPTKKKQKKTVAKQQCIB 


1 


B32 




GRVCS^GASAMSKLPGDVRAFIiRKHPSLRLQIDARKVRCILTGHB 
LPCRli PKXjQVYTRGKKYQRIiVEAS pafdyae fsph ivpstknph 
QLFCKLT liRHINKCPEHVLRHTQGHfiyQfiALCKYHEOQKQGVEY 
VPACLVHRRRRRHDOMDGDGPIlPHHAFWEPTSSDHGGAASDnSW 
TDLYPPRLFTRKDICSTEDGDGTDDFL7DKEDKKAKPPRKKATD 
BGRRBTTVYRGLVQKRGKKQW5SUCKKPKSHHRKPKSPSSC3CQS 
O 


6616 


347 

1 


1886 


LLPPOQGARPLSS PPHASRDNLFLFWNCILCAPPH PS PQPIjQYP 
VffPLLLVITQlPAPRHLRHRPPSPSRGGLDSPSGSLSTPSICRS 



530 
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PAWVKMAPWPPKGLV PAV tA4CI*SLFI4nUPGPHfLQPSPPPQSSP 
PPQPHPon'CRGLVDSKNKGLKRTIRDKFGGGNTAWKKENI.SKY 
KD SSTKLV E VL BGVCSKSDFHC nRIJJg fc S KBLVBSHWPHEQQBA 
PDLFQWLCSBSLKLCCPAGTFGPSCXPCFGGTRHPCGGYGQCEG 
BQTRGGSGHCDOQA0TGt3EACQQG8IXrfPBABStQ\SHLVCSACP 
GPCftRCSGP BR3Tff XQ CKK CWMiHHIjKCVD IIXBO OTE GA HOG3U> 
QFCVHTEGSYBCRDCAKftCbGOCAGPGRCPDCS PGT fQqVGSKC 
Ijr y D RC BTK V CPGEHKOCBI3TBGQYRCI CftHGYKQHEPICVKEQ 
IPESAGFFSEWTBDKLVVLQQMFFGII I CAIATLAAKGDLVFTA 
IPTGAVAAMTGYWLSERSDHVLEGFIKGR 



WMANQVSLliBLBDBLQCPICIiEVFKBSUfliQOGUSYCKBCbVS 

LSYHLDTKVRCpKCHOAVDGSSSLPHVSLAWIKALRLPGDPBP 

KVCVHHHHPLSIjFCZKKIX}KLXCGXiCX3ajI>GSHQHHPVTPXSTVCS 

EKECHSLAALFSBIiKQEOKKVDEI*XAKLVKHKTRIDGSAPSIjCPC 

LGPATFTFI* 



DGKvARRAPKSPAFQrrorYPLVSAPRATTftKSpWSKVLQNlyCR 
NVPKMTSBRSKI PCLSAAAABGTG10CQQBGBAMATIJ3RKVPSPR 
APLGRPHSSWIElAAKLHCSDNVDIiKBAGKBGGlCSBKVMRLWKEA 
WKYOT 



PASS BVLTAAVHFUiLNCIVAV SQHMGIGKKGBUPRPPbRMEFR 
YFQRHTTTSSVBGKQNLV IMGRJCT1TFS IPKKNRPLKDRINLVLS 
SBLKBPPQGAHFLAHSlJDrALKLTBRPEIiAHKVXXflWIVGGSSV 
YKKAMNHLGHLKLFVTRIMQDPRSDTPFSBIDLRKYKLI»PKypG 
rUSDVQEG KKI KYKFKVCKKDD 



NSRVWJFVARARMAABNBA8QBflALGfty8PVDYK3ITSFPRLPB 
DKPT^PAAPLRGRKDEDAFLGDPDTDPnSPLKSARIiQRLPSSSSB 
MGSQDGSPLRHTRKD P PSAAAASCSCRQXGlfTVTVTACIiTFATG 
VTVALVPWXTFGDPQIFQOTAVVTnAARCTSLGIBVI^KQGSSV 
DAAVAAALCliGTPAPBSSGliGGGGVMLVHDXRRNBSHLIDFRBS 
APGALRBETLQRSHETKPGLLVOVPGHVSGLHRAI^LYGRLPWS 
QVliAPAAAVAQDGFNVTHDIAKArABQLPPHMSKRPRBTPLPSG 
BPPLPGSLLHRPDLftBVLDVLGTSGPAAPYAGGNLTIiHMVAKAQ 
HAOOVITBEDFSHYSALVBKPVQGVYRGHLVLSPPPPHTGPALI 
SALHniBGPNLTSLVSREQJU*Hlf 7 ABTI«KlAIAIiASBJUS}PVYP 
STlT6SMDDHLSSVBAAYLRGHlKDSQAAPAP!UjPVYBl>DGAPT 
AAQVUMGPDDFIVAMVSSIJUQPPGSGUTPSGIIiitfSWLDPS 
WPWRTANHSAPSLRNSVQPGKRPLSPliLPTVVRPAEGLOGTYIA 
LGANGAABGLSGLTQVRFTPWXAFFSBBPS OGLDCRCLS YLMLV 
SIPHAANHG 



VQG I TSYQQKLQALRKB KSRHAARS 
AAITSQI*DKASIIRLTrSYIiKMRDFAHQGDPPWNIjRMBGPPPNT 
SVKVIGAQSRRSPSAIAIBVFBAHLGSHILQSLDaYVFALNQEG 
KFLYTSETVSrniJliSQVEIjTGSSVFDTVHPGDHVBMAEQIJSflK 
LPPGRGIiLSQGTAKDGASSASSSSQSBTPBPWCFPPASDQFLIi 



* ^Vc) J r.iTT^W 4 iH !'d ? i 



AEVGKSd X KR YCBKRPVSKyUMTGIDYGVTKVHVRDRBIKVH 
IFDMAGHPFFYBVRKPF 



KALFEKVKKFRLHVKEGDII»YAMYVRQTVI»KVIKFIiI I IAYNSA 
LVSKVQFTVDQ?VDIQDWXt3Y K HFSCHHTWRHLFSKLS FCYLCF 
VSXYGLTCLYTLYWLPYRSXiRBYSFByVRQBTGnJDIPPVKNDF 
afmlumtdqydplyskrfavflsbvsrhkleqinlnnewtpdkl 
RQK bQTlCVBHRLELPLIHl£Gl«PDTVFEITEI^Sl^LBX IKNVM 
IPATXAQLDNLQBIjSIiHQCSVKIHSAALSPLKEHIiKVLSVKFDD 
MRSLPPWMYGIiRNX*BBLYI*VGSlkSHDI3RNVTLSSXiRI)I*KBLKl 
hS I KSNVS KIP^VWVSSHLQJWniiNIXTrKLVMlliNLKlCMTN 
IiTBLBLVHCX>IiHRIPHAVFSIiI>SLQBliDLKEllNUCSIBBXVSFQ 



531 
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Amino acid segment containing signal pepti 
(A^Alanine, C^Cyofceine, D -Aspartic Acid, B=» 
Glutamic Acid, F^Pbenyl alanine , G=Glycine,. 
HoHistidine, Id3oleucine, K=Lysine , 
L=Leucine, M=Nethionine, N^Asparagine, 
P=Proline, Q=Glutamine r RsArgioine, 
S=Serine, TaThreonine, VoV&line, 
WnTryptophan, Y=Tyrosine, X ==■ Unknown , ♦^Stop 
Codon, /-possible nucleotide deletion, 

\ -possible nucleotide insertion) 



HLRKLTVLKLWHNSITYTPBHlKKLTSLKaLSFSHWKXKVLPSH 

LFLCHKUlYLDLSYHDIRFIPPBIGVLOSLQYFSITaaXVESLP 

DKLYFCISQjKTLKIGKNSIiSVLSPKIGNIiLPLSYLDGKGNHFBI 

I*PPKIi35C3?ALKSAGLWKDALPKTLPSIJVRHQp<KTB 



GSRRGGGSaiPAVSTHVAPGHSVbHPPASGAIiRIfiSLVIO^GOC 
EGRPSGLAHLSQffrSHKRAHRSGRACXGDPPGKIIxHS FIMKCTA 
RBHZAVCTVLFMAXAIPAMVVPHATLIiBXLIiBXYKDEDGEVfWlA 
KQRGKRAITDHDMQSIIiDLHNKLHSQVYPTASWlSTMTWTJVHLB 
RSABSWA BSCLWBH GPASUiPS IG»OjGAH»GRYHPPTPHVQSW 
YDKVKDPS YPYEHHCNP YCPPR CSGPVCTHYTQfWVKATSNftlGC 
AINLOiHMHIVX?QIWPKAVYI,VCHYSPB3SWHGHAPYKHGRPCS 



F;\*< n:V # 4 y y ■ J :< :4 :y • * t :« a :i W#i*j ; i »> 



HVHTRSDDSgKNgVISAQQHSQ I VS CBVRLRDQC Kgi T CMR Y BC 
PAGCU3SKAXV1GSVHYEMQSS 1CRAAIHYG1 IDKDGGWVDXTR 
QGRKHYFIKSHRNGIQTIGKYQSAH5FTVSKVTVQAVTCKTTVK 
QLCPPHKPASHCPimCPRJXYASKSTLCSareiJSSLP 



PGPRGGGGS 

GPLVPSTSDIMLLKNIBI*FDKLALIlFHGRI*I*FLKDVIiGDB ICCW 
SFYGQGRKXAEVOC7reiVYATKXK QTK V HFPK ARXPBBTliNlI*I 
YB T PR GPDPALLgflJOGAAGAGQAORGBD BBN R EH H V RRlHVRR 
BITHDBRPHGOQIVFKD 



SAVBFVYTDRPHblLGlS VBFLCSLRSDATMES ITACLHALQAL 
LDVPWPRSKIGSDQDSGIKLXiBVLHRVILTRBSPSIQLASLKW 

rqiicaaqehvkekhrsaevddoaabkbtlpbfgegkdtgglvp 
GKS LV PAT1JSLCVCILVRQLPE1HPKLTGSPGVKATKP0I LLED 
G&RLVSAALVILSKLPAVCSPEGSXSILPTILYLTIGVItRBTAV 
KLPGGQbSSTVAASLQALIUSIIiSSPMARTiEKSRTAWTDIilaRSAI* 
rr I LraroPVDBTHQBLDBVSIiL'rAITVFIbSTS PBVTT I PCLQ 
KRCIDKFKATLKIKDPVVQIKTYQIiIjHSrFQYPNPAVSYPYIYS 
LASGCMBKLQSIDKRKPBKTABLBI FOEGIKVLEl’LVTVAKEHH 
RAQLVACLIiPX MSPIdiDKGSIXJSATSIMHHIiHDpaLQHLMQIG 
PQYSSVPKSLVASSPALKARLKAAIKCaiQBSVKVKZPTSKYTKS 
PGKNSSIQLKTSFL 



G IPHLSSR11MTGTPGAVATRDGKAPKRSPPCS PS YDLTGKVMLL 
GinGVGKICPItlQFKDGAFLSGTFIATVGlDFREnCVVTVDGVRV 
KLQlHDTAGOBRPRSVTHAYYRDAQAIiIiIiLYDmiKSSFI»?IRA 
HLTB IHKYAQRD VVTmi/SNK7\D«SS BR VXRSKDGBTLAREYCV 
PFIiBTSAKTOWVBIAPLAIAKBLKYRAGHQADRPSFQIRDYVB 
SQKKHSSCCSFM 



KAN 

KBPGDSLSLBILQI IKBSQQQHGLREGX)F®YRGTfCSRHQRRIjR 
KTLNFKM(SiRHKPTGKBVTBBIiLTIHlRYI»IiLVLMDAKRAHSY7VM 
QLKQBANTBPRKRFHLJUSRLRKAVKHAEEIiBRIjCBSNRVBAKTK 



EOAVbYN^VBEISPNIHYCAYNIGDOSAINBLMQMHLRSGGTB 
gllakklealitqtrakoaatmsbvbwrcttvpvkidkvr I FLL 
GXJUINEAAXVOABSBBTKKRbPBSMLSECHDAIQVVRKSLKPDQ 
KQROYIliBGEPGKVSNLQYUISYtiTYIKLSTAIKRNBNHAKGLQ 
RALLOCW PKDDSKRSPRPQDLI RL YD I ILQNLVELbQLPGIiEED 
KAFQXBIGUrrLVFKAYRCP PIAQS YVLVKKW3EALVLYDRVLK 
XANBVNSDAGAFKNSliKDLPDVQEbITQVRSEKCSLQAAAILDA 
HDAHQTKTSSSQVKDNKPLVKRFETPCtiDPSLVTKOANLVHPPP 
GFQPIPCXPLPFDLALNHVAFPPbEDKLBQKTKSGLTGYTKOIF 
GPRS 



GAJPbGSVGGRTGKMnAATLTYDTLRPABPBDPPBTSEFVWILG 
RKYSI FTKJO>BIl>SlJVASRLWPTYRKNPPAIGGTGPTSDrrGWGC 



532 
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segment containing signal peptide 
{A^ Alanine, C^Cysteine, D^Aspartic Acid, E= 
Olutamic Acid, P=Phenylalaninfi , G=Glycine, 
H=sHistidine, l=Isoleucine, K=Lysine, 
I*=Leucine, M=He thionlne , N ^Asparagine, 

P= Proline , Q=Glutamine, B=Arginine, 

SaSerine, TaThreanine, V=Valine, 

W^Tryptophan, YaTyroeine, X»T7h]aiovn, *«»Stop 
Codon, /^possible nucleotide deletion, 

\ impossible nucleotide insertion) 



MUlCCK>lIPAQAbVaiHIX?HDWRHTQRKRQPDSYPSVlf3AFIDR 
KDSYYSIHQIAttfatVOBGKSIGQWY Gt WX V AQVliKKIAVFTyrWS 
SLAVHXAMHiTVVMEBI RXIjCTTS VPCAGATAFPADSDRHCNGP 
PAGAKVTEUtPSPWRPLVLIiIPIiHLGLTDlNKAYVErLKBCPIMP 
QSLOyiGGKraSAHypIGYVGBgLrnjDPHTTQPA VBP TD GCFI 
PDBSFE0QHPPC3MS1AKLDPSIAVVRGC3HLSTQAPGABC!CLGM 
TRKTFGPLRFFPSMLO 



LVQCOTIKRRSAnGAMPGKHVSRVRAIiYKRVLQIiHRVIiPPDLKS 
IGDQYVKDBFRRHKTVGSDBAQpFLQ KWKVYATAbUXJAHHNHQ 
HSTCKACFGTPLPSKKLHDPRDBQIGQLQHLWQKATKFNHQFSl 
SBSMXPKF 



IfVQOGGJZtHRSAHGAMPGRHVSRVRAZfVRBVZfQZiHRVLPPDLKS 

IGDQVVKDSF&EtHKTVGSDEAQRFI^QENBVfAXALLQOAHENRQ 

NSTGKACFGTFI^KEKLHDFRDBQJGQIiQKLMQEATKPlIRQFSI 

S BS HKPKP 



IWSSGKIOBGAAPIjAPAAAMXAVVQrVTkASvTVGGEQISAIGR 
GICVLLGXSLEITrQESLSHMVRKILNLRVPBDESGKHHSKSVMD 
KQYEILCVSQFTLQCVIJB33KPDFH1AMPTBQABGFYHS fleol 
RKTYRPSl*rRDGKFGAXHQVHl(9IDGP\nnELBSPAPG7AI6DP 
KQLSKLSKQQQRKEKTRAKGPSBSSKKRNTPHKKDSaASSGABG 
DVSSBRKP 



PTLKGI IQQLVNGX XTPATI PSIiG PffGVbBSNPMDY 
AKGANGLDAI ITQLI/JQFHNTGPPPADKKKIOALPTVPVTEKHV 
GSGLKCPVCKDDZAUCERVRQLPCNHLPHDGC XVPWLBQHDSCP 
VCSK$Llt3Qf?TATJIPPGLTGVSFSSSSSSS$SSSPSNKHATSNS 



OGGIPRJCGSGPRBilIiPMAHijRDCLiPRLMLT3jRSIiLFWSI*VYCYC 
GLCAS IHLLKLLW6LGKGPAQTFRRPAREHP PACLSDP3LGTHC 
yVRIKDSGLRPHTVAftGJBHGKPLMLlaLfiGFPHFWTSWRy'OUlBF 
KSBYHWAIiDLHGYGfiTDAPIHRQNYiajXXrrDIKDIIiDSIjGY 
SK(AfrIGHDIiGGMIAHL2AICYFBHVMKZ>IVINFPHPNVFTEYT 
T AHPAQUUCSSYTY7FQI PWFPEFMFS INDFKVLKHLFTSHSTG 
IGRKGCQLTTKDliEAYXYVFSQ PGAI^GPIKHYRNI FSCLPLKH 
HMVTTPTtliW*GEKnAFMHV^MASVTRFWKNYFRI/rXI*SBASH 
JfLMDQPDIVNKLrH’rFLKBBTRKKD 



BMHAGQQLASMXiKWTRAHRLPREGXjGPHGPSFAHVPVAPSSSSG 
CStGGABPRPLPI*STRX£HXJEAAIiPAVVyi«HGI*FOS KTNFHS IAK 
IIAQQTGHJRVbTVDARNHGDS FHSPDMS YEXKSQDLQDLIiPQliG 
LVPCVWGHS5S3G1 TAMLLAIjQRPKI.VKRLIAVDXS p vbstgvs 
HFATTVAAMRAH ilAOELPRSRARBZADSQbSSVIQDMAVRQRZ* 
!JHLVBVDGSFVRRVHlJALTQilLDKlJLAFPQBQBSYLGPTLPI» 
LGGNSQFVHPBHHPEDCUtFPRAQnQTFPHAGHfVIHADRPQDPI 
AAIROFLV 



SFCMFSHKQJQSHFQAVPVQBKKKRLRHAPWRAFAQPQRIiKHPAK 

QPIVRQCLQBPPbOGfVbGPVQOQDUPPSLGPVliSPHSDPGWCRVD 

OGGDGVF 



CSSSPCFHIXrrCVlJDKAGSYKCACIAGYTGQRCKNU^AGKSKl 
KASBDSLSVIBBEHCSDPGGIW5YQKITGGPGLIHGRHAKIGT 
WSPFCKWSyVLSGNEKRTCQQNGEWSGKQPICIKACRBPKISP 
LVHRRVLPKQVQSRBTPLHQLYSAAFSBQRIjQSAPTKKPALPFG 
DLPM3IQHLHTQIOYKCXSP FYRRI/5SSRRTCLRTGXKSGRAPS 
CIPIOGKIEHITAPKTQGLRWPWQAAIYRftTSGVHDGSLHKlGAIJ 
FLVCSGALVNBRTVVVAAHCVTDLGKVTMIKEADLKVVLGKFYR 
DDD3U)BKTIQSLQISAXILHPNYDPILLDADlAIItKLLDK7mXS 
TRVQPICIAASRDLSTSFQBSHrTVAGWNVLADVRSPGPKNDTL 
RSGWSVVDSXiLCBBQHKDHGIPVSVTDIJMFCASWBP'EAPSDIC 
tabtggiaavsfpgraspbprwhlmgi*vswsxdktcshri*staf 
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